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Abstract. This survey of methods surrounding lattice point methods for binomial 
ideals begins with a leisurely treatment of the geometric combinatorics of binomial 
primary decomposition. It then proceeds to three independent applications whose 
motivations come from outside of commutative algebra: hypergeometric systems, 
combinatorial game theory, and chemical dynamics. The exposition is aimed at 
students and researchers in algebra; it includes many examples, open problems, and 
elementary introductions to the motivations and background from outside of algebra. 
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Introduction 



Binomial ideals in polynomial rings over algebraically closed fields admit binomial 
primary decompositions: expressions as intersections of primary binomial ideals. The 
algebra of these decompositions is governed by the geometry of lattice points in poly- 
hedra and related lattice-point combinatorics arising from congruences on commuta- 
tive monoids. The treatment of this geometric combinatorics is terse at the source 
|DMM10] . Therefore, a primary goal of this exposition is to provide a more leisurely 
tour through the relevant phenomena; this is the concern of Sections [U El and El 

That the geometry of congruences should govern binomial primary decomposition 
was a realization made in the context of classical multivariate hypergeometric se- 
ries, going back to Horn, treated in Section HI Lattice-point combinatorics related to 
monoids and congruences has recently been shown relevant to the theory of combi- 
natorial games, and is sure to play a key role in algorithms for computing rational 
strategies and misere quotients, as discussed in Section [5j Finally, binomial commu- 
tative algebra is central to a long-standing conjecture on the dynamics of chemical 
reactions under mass-action kinetics. The specifics of this connection are briefly out- 
lined in Section EJ along with the potential relevance of combinatorial methods for 
binomial primary decomposition. Limitations of time and space prevented the in- 
clusion of algebraic statistics in this survey; for an exposition of binomial aspects of 
Markov bases and conditional independence models, such as graphical models, as well 
as applications to phylogenetics, see |DSS09] and the references therein. 

Sections [U El and [3] are complete in the sense that statements are made in full 
generality, and precise references are provided for the details of any argument that is 
only sketched. In contrast, Sections HI [5J and|6]are more expository. The results there 
are sometimes stated in less than full generality — but still mathematically precisely — 
to ease the exposition. In addition, Sections HJ[5j and[6]are independent of one another, 
and to a large extent independent of Sections (U El and [21 as well; readers interested in 
the applications should proceed to the relevant sections and refer back as necessary. 

Acknowledgements. I am profoundly grateful to the organizers and participants of 
the International School on Combinatorics at Sevilla, Spain in January 2010, where 
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these notes were presented as five lectures. That course was based on my Abel Sym- 
posium talk at Voss, Norway in June 2009; I am similarly indebted to that meeting's 
organizers. Thanks also go to my coauthors, from whom I learned so much while 
working on various projects mentioned in this survey. Funding was provided by NSF 
CAREER grant DMS-0449102 = DMS-1014112 and NSF grant DMS-1001437. 

Part 1. Theory 

1. AFFINE SEMIGROUPS AND PRIME BINOMIAL IDEALS 

1.1. Affine semigroups. Let Z d C M. d denote the integer points in a real vector 
space of dimension d. Any integer point configuration 

A = {ai, . . . ,a„} C Z d < — ► A = 

can be identified with a d x n integer matrix. 

Definition 1.1. A monoid is a set with an associative binary operation and an 
identity element. An affine semigroup is a monoid that is isomorphic to 

NA = N{ai, . . . , a n ,} = {ciai H h c n a„ | Cl , . . . , c n e N} 

for some lattice point configuration A C Z d . 

Thus a monoid is a group without inverses. Although a semigroup is generally 
not required to have an identity element, standard terminology from the literature 
dictates that an affine semigroup is a monoid, and in particular (isomorphic to) a 
finitely generated submonoid of an integer lattice Z d for some d. 

Example 1.2. The configuration a 



ai 



e Z 1 



dxn 



A 



1111 
12 3 



< — > 




in Z 2 , drawn as solid dots in the plane, generates the affine semigroup comprising all 
lattice points in the real cone bounded by the thick horizontal ray and the diagonal 
ray. This example will henceforth be referred to as "0123". 
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Example 1.3. A point configuration is allowed to have repeated elements, such as 




in Z 2 , which generates the affine semigroup NA = N 2 of all lattice points in the 
nonnegative quadrant. This example will henceforth be referred to as "Jin". 

Example 1.4. It will be helpful, later on, to have a three-dimensional example ready. 
Consider a square at height 1 parallel to the horizontal plane. It can be represented 
as a matrix and point configuration in Z 3 as follows: 




The affine semigroup NA comprises all of the lattice points in the real cone generated 
by the vertices of the square. 

In all of these examples, the affine semigroups are normal: each one equals the set 
of all lattice points from a rational polyhedral cone. General affine semigroups need 
not be normal, though they always comprise "most" of the lattice points in a cone. 

Example 1.5. In Example II. 2\ the outer columns of the matrix, namely [J] and [*] , 
correspond to the extremal rays; they therefore generate the rational polyhedral cone 
whose lattice points constitute the 0123 affine semigroup. Consequently, the configu- 
ration [HI] generates the same rational polyhedral cone, but the affine semigroup it 
generates is different — and not normal — because the point [j] does not lie in it, even 
though the configuration still generates Z 2 . 

The geometry of binomial primary decomposition is based on the sort of geometry 
that arises from the projection determined by A. To be more precise, A determines 
a monoid morphism Z d N n . This morphism can be expressed as the restriction of 
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the group homomorphism (the linear map) 7L d Z n induced by A. The diagram is 
as follows: 

a, < — i Bj 

n 

d A 

Z d i 
Z d i 

ni 

R d i 

The kernel L of the homomorphism Z d Z n induced by A is a saturated lattice 
in Z n , meaning that Z n /L is torsion-free, or equivalently that L = RL D Z n , where 
RL = R ®z L is the real subspace of R n generated by L. 

It makes little sense to say that the monoid morphism Z d 4— N n has a kernel: 
it is often the case that L fl N n = {0}, even when the monoid morphism is far 
from injective. However, when Z d ^— N n fails to be injective, the fibers admit clean 
geometric descriptions, inherited from the fact that the fibers of the vector space map 
R d <- W 1 are the cosets of RL in R n . 

Definition 1.6. A polyhedron in a real vector space is an intersection of finitely many 
closed real half-spaces. 

This survey assumes basic knowledge of polyhedra. Readers for whom Definition [OJ] 
is not familiar are urged to consult |Zie95t Chapters 0,1, and 2]. 

Lemma 1.7. The fiber of the monoid morphism Z <— N n over a given lattice point 
a G Z d is the set 

F a = rc n (u + L) = N n n P a 

of lattice points in the polyhedron 

P a = (u + EL) n M^ 

for any vector u G Z d satisfying Au = a, where L = ker A. 

This description is made particularly satisfying by the fact that the polyhedra for 
various a G Z d are all related to one another. 

Example 1.8. When A = [1 1 1] is the "coordinate-sum" map N ^— N 3 , the poly- 
hedra P a for a G N are equilateral triangles, the lattice points in them corresponding 



ni 

z n < — l <- o 
ni ni 
R n < — rl <— o 
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to the monomials of total degree a in three variables: 



fibers of A = [1 1 1] 



i — > 




The polyhedra in Example 11.81 are all scalar multiples of one another, but this 
phenomenon is special to codimension 1. In general, when n — d > 1, the polyhedra 
P a in the family indexed by a G NA have facet normals chosen from the same fixed 
set of possiblities — namely, the images in L* of the dual basis vectors of {IT 1 )* — so 
their shapes feel roughly similar, but faces can shrink or disappear. 

Example 1.9. In the case of 0123, a basis for the kernel ker/L can be chosen so that 
the inclusion Z n i— ker A is given by the matrix 



B 



1 

-2 1 

1 -2 

1 





so that, for example, the depicted polytopes P a and Pp for a = [® 2 ] and = [L] 
both have outer normal vectors that are the negatives of the rows of B. Moving to 
P 1 for 7 = [^1 would shrink the bottom edge entirely. The corresponding fibers F a , 
F/3, and _F 7 comprise the lattice points in these polytopes. 

1.2. AfRne semigroup rings. 

Definition 1.10. The affine semigroup ring of NA over a field Ik is 

k[NA] = k • t a , 

a subring of the Laurent polynomial ring 

k[Z d ] = k[tf\. 

in which 

t Q = t? 1 ■ ■ ■ t°, d and 



/-ill 
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The definition could be made with k an arbitrary commutative ring, but in fact the 
case we care about most is k = C, the field of complex numbers. The reason is that 
the characteristic zero and algebraically closed hypotheses enter at key points; these 
notes intend to be precise about which hypotheses are needed where. 

Definition 1.11. Denote by tta the surjection 

TV A 

k[NA] «- k[x] 
ij * A i x% 

onto the affine semigroup ring k[NA] from the polynomial ring k[x] := k[x\ . . . , x n ]. 

The next goal is to calculate the kernel I a = ker(7Tyi). To do this it helps to note 
that both k[NA] and the polynomial ring are graded, in the appropriate sense. 

Definition 1.12. Let A G Z dxn , so ZA C Z d is a subgroup. A ring R is A-graded if 
R is a direct sum of homogeneous components 

R = R a , such that R a Rp Q Ra+p- 

aeZA 

An ideal in an A-graded ring is A-graded if it is generated by homogeneous elements. 

Example 1.13. The affine semigroup ring k[NA] is A-graded, with k[NA] Q = k-t a if 
a G NA, and k[NA] Q = otherwise. The polynomial ring k[x] is also A-graded, with 

k[x] Q = k • {F a }, 

the vector space spanned by the fiber F a from Lemma 11.71 

Proposition 1.14. The kernel of the surjection tta from Definition \l.ll\ is 

I A = (x u - x v | u, v G N n and Au = Av) 
= (x u - x v I u, v G N n and u - v G ker A). 

Proof. The "D" containment follows simply because 7Tyi(x u ) = x Au . The reverse 
containment uses the A-grading: in the ring R = k[x]/(x u — x v | Au = Av), the 
dimension of the image of k[x] a as a vector space over k is either or 1 since Au = 
Av = a if u and v lie in the same fiber F a . On the other hand, R -» k[NA] Q maps 
surjectively onto the affine semigroup ring by the "D" containment already proved. 
The surjection must be an isomorphism because, as we noted in Example 11.131 k[NA] a 
has dimension 1 whenever F a is nonempty. □ 

Corollary 1.15. The toric ideal Ia is prime. 



Proof. k[NA] is an integral domain, being contained in k[Z d ]. 



□ 
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Example 1.16. In the 0123 case, using variables {a, b, c, d} instead of {xi,X2, £3, £4}, 



.4 



1 





and B 



1 

-2 
1 





1 

-2 
1 



[ac 



b 2 , bd — c 2 , ad — be), 



where ker A is the image of B in Z 4 . The presence of the binomials ac — b 2 and bd — c 2 
in I a translate the statement "the columns of the matrix B lie in the kernel of A" . 
However, note that Ia is not generated by these two binomials; it is a complicated 
problem, in general, to determine a minimal generating set for I a- 

Example 1.17. In the H°° case, using variables {a, b, c, d} instead of {x±, x 2 , x 3 , X4}, 



A 



1 







1 1 



and B 



1 

-1 
1 




1 

-1 

1 



(ac — b,c — d). 



where again ker A is the image of B in Z 4 . In this case, I a is indeed generated by 
two binomials corresponding to a basis for the kernel of A, but not the given basis 
appearing as the columns of B. In Section HI we shall be interested in the ideal 
generated by the two binomials corresponding to the columns of B. 

Example 1.18. In the square cone case, using {a,b,c,d} instead of {xi, x 2 , x 3 , x 4 }, 



.4 



1 



1 1 



and B 



1 

-1 
-1 
1 



(ad — be) , 



where ker A is the image of B. In the codimension 1 case, when ker A has rank 1, the 
toric ideal is always principal, just as any codimension 1 prime ideal in k[x] is. 

1.3. Prime binomial ideals. At last it is time to define binomial ideals precisely. 

Definition 1.19. An ideal / C k[x] is a binomial ideal if it is generated by binomials 

x u - Ax v with u, v e N n and A G k. 

Note that A = is allowed: monomials are viable generators of binomial ideals. This 
may seem counterintuitive, but it is forced by allowing arbitrary nonzero constants A, 
and in any case, even ideals generated by differences of monomials ("pure-difference 
binomials") have associated primes containing monomials. 



Example 1.20. The ideal 



y 2 , x 3 



2y 2 ) C k [#,?/] is generated by binomials but 



equals the monomial ideal (x 3 , y 2 ), no matter the characteristic of k. Worse, the ideal 
/ = (x 2 — xy, xy — 2y 2 ) C k[x, y] is generated by "honest" binomials that are not linear 
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combinations of monomials in 7, and yet I contains monomials, because I contains 
both of x 2 y — xy 2 and x 2 y — 2xy 2 , so the monomials x 2 y and xy 2 lie in I. 

Example 1.21. The pure- difference binomial ideal I = (x 2 — xy,xy — y 2 ) C h[x, y] 
has a monomial associated prime ideal (x, y), since 7 = (x — y) Fl (x 2 , y). 

In general, which binomial ideals are prime? We have seen that toric ideals I a are 
prime, but for binomial primary decomposition in general it is important to know 
all of the other binomial primes, as well. The answer was given by Eisenbud and 
Sturmfels [ES961 Corollary 2.6]. 

Theorem 1.22. When k is algebraically closed, a binomial ideal I C k[a?i, . . . , x m ] 
is prime if and only if it is the kernel of a surjective A-graded homomorphism n : 
k[x] -» k[NA] in which the variables are homogeneous. 

The surjection ir in Theorem 11.221 need not equal tta- For example, vr(xj) = 
is allowed, so I could contain monomials. Furthermore, even when vr(xj) 7^ 0, the 
image of Xj could be At" for some A ^ 1. Thus, if x u 1— > Xt a and x v 1— > /it Q , then 
x u — ^x v G I: the binomial generators of I need not be pure differences. However, 
assuming I is prime, we are not free to assign the coefficients A and p, at will: 

x u - f x v G 7 x u+w - M x v+w G / for w G N n 

and x ru - f^x rv G 7 for r G N. 

The first line means that the coefficient on x v in x u — x v depends only on u — v, 
and the second means essentially that the assignment u — v 1— > fi/\ constitutes a 
homomorphism ker A — > k*. The precise statement requires a definition. 

Definition 1.23. A character on a sublattice L C Z" is a homomorphism p : L — > k*. 
If L C Z J for some subset J C {1, . . . , n}, then 

I P) j = I P + mj, 
where I p = (x u — p(u — v)x v | u — v G L) 
and mj = (xj | i G" J). 

Corollary 1.24. A binomial ideal I C k[x] with k algebraically closed is prime if and 
only if it is I p ^j for a character p : L — > k* defined on a saturated sublattice L C Z J . 

In other words, every prime binomial ideal in the polynomial ring k[x] over an 
algebraically closed field k is toric after forgetting some of the variables (those outside 
of J) and rescaling the rest (by the character p). 

Remark 1.25. Given any sublattice L C Z n , a character p is defined as a homomor- 
phism L — > k*. On the other hand, rescaling the variables Xj for j G J amounts to a 
homomorphism Z J — > k*. When L C Z J , there is usually no unique way to extend 
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p to a character Z J — > k* (there can be a unique way if k has positive characteris- 



tic). However, there is always at least one way when L is saturated- 
L Z n is split — because the natural map Hom(Z J ,k*) — > Hom(L, 



-so the inclusion 
[*) is surjective. 



Example 1.26. Let u 



G C be a primitive cube root of 1. If L C Z 4 C Z 5 



is spanned by the columns of the matrix B, below, and the character p takes the 
indicated values on these generators of L, then I Py j for J = {1, 2, 3, 4} is as indicated. 



B 



-1 
2 

-1 










-1 

2 
-1 




p : tu oj u) 



'p,{l,2,3,4} 



be — u 2 ad, b 2 — uac, c 2 — cobd, e). 



For instance, when u = (0, 1, 1, 0) and v = (1, 0, 0, 1), we get p(u — v) = u 2 . Compare 
this example to the 0123 case in Example 11.161 



2. Monomial ideals and primary binomial ideals 

The lattice-point geometry of binomial primary decompostion generalizes the geom- 
etry of monomial ideals. For primary binomial ideals, the connection is particularly 
clear. To highlight it, this section discusses what it looks like for a primary binomial 
ideal to have a monomial associated prime. The material in this section is developed 
in the context of an arbitrary affine semigroup ring, because that generality will be 
crucial in the applications to binomial ideals in polynomial rings k[x]. In return for the 
generality, there are no restrictive hypotheses on the characteristic or algebraic closure 
of k to contend with; except in Example 12.191 and Theorem I2.20[ k can be artbitrary. 

Definition 2.1. An ideal / C k[Q] in the monoid algebra of an affine semigroup Q 
over an arbitrary field is a monomial ideal if it is generated by monomials t a , and / 
is a binomial ideal if it is generated by binomials t a — At' 3 with a, (3 G Q and A G k. 

2.1. Monomial primary ideals. Given a monomial ideal, it is convenient to have 
terminology and notation for certain sets of monomials and lattice points. 

Definition 2.2. If / C k[Q] is a monomial ideal, then write std(J) for the set of 

exponent vectors on its standard monomials, meaning those outside of /. 
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Example 2.3. Here is a monomial ideal in k[Q] for Q = N 2 : 

z 



/ = (x 6 ,x 5 z,x 2 z 3 ,z 4 ) C k[x,z] < — > t * 

o o o o o 

o o o o o • — 

— o — o — o — o — o — o — • — ' — s~ x 

The reason for using z instead of y will become clear in Example 12.41 The bottom 
of the cross-hatched region is the staircase of /; its lower corners are the (lattice 
points corresponding to) the generators of /. The lattice points below the staircase 
correspond to the standard monomials of /. 

Example 2.4. The same monomial generators can result in a higher-dimensional 
picture if the ambient monoid is different. Consider the generators from Example 12.31 
but in k[Q] for Q = N 3 : 



z 



I = (x 6 , x 5 z, x 2 z 3 , z 4 ) C k[x, y, z] 




The "front face" of the picture — corresponding to the xz-plane — coincides with Ex- 
ample 12.31 The surface cross-hatched in thick lines is the staircase of /; its minimal 
elements again correspond to the generators of I, drawn as solid dots. Below the 
staircase sit the lattice points corresponding to the standard monomials of /. There 
are infinitely many standard monomials, but they occur along finitely many rays par- 
allel to the y- axis, each emanating from a point drawn as a bold hollow dot; these are 
the points in std(J) in the xz-plane. 

Example 2.5. In the square cone case, Example 11.44 let a , b, c, d be the genera- 
tors of the affine semigroup ring, as indicated in the figure there. Thus k[Q] = 
k[a, b, c, d}/ {ad — be). The monomial ideal (c, d) C k[Q] is prime; in fact, the compos- 
ite map k[a, b] k[Q] -» k[Q]/ (c, d) is an isomorphism. 



The phenomenon in Example 12.51 is general; the statement requires a definition. 
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Definition 2.6. A face F C Q of an affine semigroup Q C Z is a subset F = Q f] H 
obtained by intersecting Q with a halfspace -ff C M. d such that Q C H + is contained 
in one of the two closed halfspaces H + , H~ defined by H in ¥L d . 

Lemma 2.7. A monomial ideal I in an affine semigroup ring is prime std(J) is a 
face. The prime pp for F C Q induces an isomorphism k[F] ^ k[Q] -» k[Q]/p F . 

For a proof of the lemma, and lots of additional background on the connections 
between faces of cones and the algebra of affine semigroup rings, see |MS05[ §7.2]. 

Before jumping to the question of when an arbitrary binomial ideal is primary, let 
us first consider the monomial case. In a polynomial ring k[x], there is an elementary 
algebraic description as well as a satisfying geometric one. Both will be important 
in later sections, but it is the geometric description that generalizes most easily to 
arbitrary affine semigroup rings. 

Proposition 2.8. A monomial ideal I C k[x] is primary if and only if 

I = (x™ 1 , . . . , x^J , some other monomials in x^, . . . ,Xi r ). 

A monomial ideal I C k[Q] for an arbitrary affine semigroup Q C Z d is pp-primary 
for a face F C Q if and only if there are elements a\, . . . , ag G Q with 

i 

std(J) = \J{a k + ZF)nQ, 

k=l 

where "LF is the subgroup ofL d generated by F . 

Proof. The statement about k[x] is a standard exercise in commutative algebra. The 
statement about k[Q] is the special case of Theorem l2.23l below, in which the binomial 
ideal I is generated by monomials. □ 

What does a set of the form (a + W,F) D Q look like? Geometrically, it is roughly 
the (lattice points in the) intersection of an affine subspace with a cone. In the 
polynomial ring case, where Q = N n , a set (a + ZF) D Q is always + F for some 
lattice point /3 G N n : the intersection of a translate of a coordinate subspace with 
the nonnegative orthant is a translated orthant. In fact, F = N J = N n n Z J for some 
subset J C {1, . . . , n}, and then /3 is obtained from a by setting all coordinates from J 
to (if a has negative coordinates outside of J, then a + ZF fails to meet N n ). For 
general Q, on the other hand, (a + ZF) D Q need not be a translate of F. 

Example 2.9. The prime ideal in Example 12.51 corresponds to the face F consisting 
of the nonnegative integer combinations of t\ + e% and e$, where ei,e2,es are the 
standard basis of Z 3 . In terms of the depiction in Example II. 4[ these are the lattice 
points in Q that lie in the xz-plane. The subgroup ZF C Z 3 comprises all lattice 
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points in the xz-p\ane. Now suppose that a = the first lattice point along the 
2/-axis. Then 




1 




Z • {xzj-plane ) H Q 



< — > 




is a union of two translates of F. For reference, the square over which Q is the cone 
is drawn lightly, while dotted lines fill out part of the vertical plane a + 7LF . 

Proposition 2.10. Every monomial ideal I C k[Q] in an affine semigroup ring has a 
unique minimal primary decompostion I — PiH - • -nP r as an intersection of monomial 
primary ideals Pi with distinct associated primes. 

Proof sketch. I has a unique irredundant decompostion / = W\ D • • • D W s as an 
intersection of irreducible monomial ideals Wj. The existence of an irredundant ir- 
reducible decompostion can be proved the same way irreducible decompostions are 
produced for arbitrary submodules of noetherian modules. The uniqueness of such a 
decompostion, on the other hand, is special to monomial ideals in affine semigroup 
rings |MS05l Corollary 11.5]; it follows from the uniqueness of irreducible resolutions 
|Mil02l Theorem 2.4]. See |MS05l Chapter 11] for details. 

Given the uniqueness properties of minimal monomial irreducible decompostions, 
the (unique) monomial primary components are obtained by intersecting all irre- 
ducible components sharing a given associated prime. □ 

Remark 2.11. In polynomial rings, uniqueness of monomial irreducible decompos- 
tion occurs for approximately the same reason that monomial ideals have unique 
minimal monomial generating sets: the partial order on irreducible ideals is particu- 
larly simple |Mil09t Proposition 1.4]. See |MS05t §5.2] for an elementary derivation of 
existence and uniqueness of monomial irreducible decompostion by Alexander duality. 
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2.2. Congruences on monoids. The uniqueness of irreducible and primary decom- 
postion of monomial ideals rests, in large part, on the fine grading on k[Q], in which 
the nonzero components k[Q] a have dimension 1 as vector spaces over k. Similar 
gradings are available for quotients modulo binomial ideals, except that the gradings 
are by general noetherian commutative monoids, rather than by free abelian groups 
or by affine semigroups. Our source for commutative monoids is Gilmer's excellent 
book |Gil84j . For the special case of affine semigroups, by which we mean finitely 
generated submonoids of free abelian groups, see [MS051 Chapter 7]. 

For motivation, recall from Lemma 11.71 that the fibers of a monoid morphism from 
N n to Z d have nice structure, and that the polynomial ring k[x] becomes graded by Z d 
via such a morphism. The fibers are the equivalence classes in an equivalence relation, 
as is the case for any map 7r : Q — > Q' of sets; but when 7r is a morphism of monoids, 
the equivalence relation satisfies an extra condition. 

Definition 2.12. A congruence on a commutative monoid Q is an equivalence relation 
~ that is additively closed, in the sense that 

u ~ v =>- u + w ~ v + w for all w G Q. 

The quotient Q/~ is the set of equivalence classes under addition. 

Lemma 2.13. The quotient Q = Q/~ of a monoid by a congruence is a monoid. Any 
congruence ~ on Q induces a Q-grading on the monoid algebra k[Q] = ® ueQ k • t u 
in which the monomial t u has degree u G Q whenever u^u. 

Proof. This is an easy exercise. It uses that the multiplication on k[Q] is given by 

t u t v = t u+v for v g Q □ 

Definition 2.14. In any monoid algebra k[Q], a binomial ideal I C k[Q] generated by 
binomials t u — Xt v with A G k induces a congruence ~ (often denoted by ~/) in which 

u ~ v if t u - \t v G / for some A ^ 0. 

Lemma 2.15. Fix a binomial ideal I C k[Q] in a monoid algebra. Then I and k[Q}/I 
are both graded by Q = The Hilbert function Q i— > N ; which for any Q-graded 

vector space M takes q i— > dimt Mq, satisfies 



dim k (k[Q]//) ? 



if q = {u G Q | t u G 1} 
otherwise. 



The proof of the lemma is another simple exercise. To rephrase, it says that every 
pair of monomials in a given congruence class under ~/ are equivalent up to a nonzero 
scalar modulo /, and the only monomials mapping to are in I. A slightly less set- 
theoretic and more combinatorial way to think about congruences uses graphs. 
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Definition 2.16. Any binomial ideal / C k[Q] defines a graph Gi whose vertices are 
the elements of the monoid Q and whose (undirected) edges are the pairs (u,v) G QxQ 
such that t" — At' 1 ' G / for some nonzero A G k. Write n Q Gi for the set of connected 
components of Gi. 

Thus C G ttqGi is the same thing as a congruence class under ~/. The moral 
of the story is that combinatorics of the graph Gj controls the (binomial) primary 
decompostion of I. 

Example 2.17. Each of the two binomial generators of the ideal 



I = (x 2 — xy, xy — y 2 ) C k[x, y] 




determines a collection of edges of the graph Gj, indicated in the figure, by additivity 
of the congruence ~j. In reality, Gj has many more edges than those depicted: 
since ~/ is an equivalence relation, every connected component is a complete graph 
on its vertex set. However, in examples, it is convenient to draw — and more helpful 
to see — only edges determined by monomial multiples of generating binomials. 

The connected components of Gj are the fibers of the monoid morphism from N 2 to 

N 2 /~/ = ' ' 

the monoid N with the element 1 "doubled" . The ideal / has primary decomposition 

I = (x 2 ,xy,y 2 ) (l(x-y). 

The first primary component reflects the three singleton components of Gj near the 
origin. The other primary component reflects the diagonal connected components 
of Gi marching off to infinity. 

Although the N-graded Hilbert function of k[x, y]/I takes the values 1, 2, 1, 1, 1, . . ., 
the N 2 -graded Hilbert function takes only the value 1. 

Example 2.18. When / = I a is the toric ideal for a matrix A, the connected com- 
ponents of Gj are the fibers F a for a G NA. 

Example 2.19. If / = I p j is a binomial prime in a polynomial ring k[x] over an 
algebraically closed field k, and C G vroG/ is a connected component, then either 
C = N™ \ N J or else C = (u + L) PI N J for some u G N J . When p is the trivial (only) 
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character on the lattice L = {0} C and J = {3} C {1, 2, 3}, for instance, then 



In this case N n \ N J consists of the monomials off of the vertical axis (i.e., those in 
the region outlined by bold straight lines), whereas every component (u + L) D N J 
of Gi is simply a single lattice point on the vertical axis. 

2.3. Binomial primary ideals with monomial associated primes. The alge- 
braic characterization of monomial primary ideals in the first half of Proposition 12.81 
has an approximate analogue for primary binomial ideals, although it requires hy- 
potheses on the base field k. 

Theorem 2.20. Fix k algebraically closed of characteristic 0. If I C k[x] is an I p j- 
primary binomial ideal, then I = I p + B for some binomial ideal B D xxij with £ > 0. 

Proof. This is the characterization of primary decomposition [ES96| Theorem 7.1] 
applied to a binomial ideal that is already primary. □ 

The content of the theorem is that / contains both I p and a power of each variable 
Xi for i J. (In positive characteristic, I contains a Frobenius power of I p , but not 
necessarily I p itself.) A more precise analogue of the algebraic part of Proposition 12.81 
would characterize which binomial ideals B result in primary ideals. However, there 
is no simple way to describe such binomial ideals B in terms of generators. The best 
that can be hoped for is an answer to a pair of questions: 

1. What analogue of standard monomials allows us to ascertain when I is primary? 

2. What property of the standard monomials characterizes the primary condition? 

Preferably the answers should be geometric, and suitable for binomial ideals in arbi- 
trary affine semigroup rings, as in the second half of Proposition 12.81 

Lemma 12.151 answers the first question. Indeed, if I C k[Q] is a monomial ideal, 
then std(J) is exactly the subset of Q such that k[Q]/7 has Q-graded Hilbert function 
dim^ (k[Q]/I) u = 1 for u G std(J) and for t u G /. In the monomial case, we could 
still define the monoid quotient Q = Q/~i, whose classes are all singleton monomials 
except for the class of monomials in I. Therefore, for general binomial ideals J, the 
set of non-monomial classes of the congruence ~/ plays the role of std(7). 




X 



THEORY AND APPLICATIONS OF LATTICE POINT METHODS FOR BINOMIAL IDEALS 17 



Note that Proposition 12.81 answers the second question for monomial ideals in affine 
semigroup rings, whose associated primes are automatically monomial. The next step 
relaxes the condition on / but not on the associated prime: consider a binomial ideal 
/ in an affine semigroup ring k[Q], and ask when it is pir-primary for a face F C Q. 
The answer in this case relies, as promised, on the combinatorics of the graph Gi from 
Definition 12. 161 and its set ttqGi of connected components; however, as Proposition 12.81 
hints, the group generated by F enters in an essential way. 

Definition 2.21. For a face F of an affine semigroup Q, and any k[Q]-module M, 

M[ZF] = M (g) k[Q] k[Q + IF] 

is the localization by inverting all monomials not in pp. If I C k[Q] is a binomial 
ideal, then a connected component C G 7r G/ is F -finite if C — C D Q for some finite 
connected component C of the graph GWfi for the localization l\LF\ C k[Q][ZF]. 

Thus, for example, k[Q][ZF] = k[Q + ZF] is the monoid algebra for the affine 
semigroup Q + ZF obtained by inverting the elements of F in Q. In Proposition 12.81 
where / C k[Q] is a monomial ideal, all of the connected components of Gj inside 
of std(J) are singletons, and the same is true of Gj[zf\. See Section [37T1 for additional 
information and examples concerning the geometry and combinatorics of localization. 

Example 2.22. Connected components of Gi can be finite but not F-finite for a 
given face F of Q. For instance, if / = (x — y) C k[x, y] = k[N 2 ], then the connected 
components of Gj are all finite — they correspond to the sets of monomials in k[x, y] 
of fixed total degree — but not F- finite if F is the horizontal axis of Q = N 2 : once x is 
inverted, ttqGi[ x -ij consists of infinite northwest-pointing rays in the upper half-plane. 

Theorem 2.23. Fix a monomial prime ideal pp = (t u \u ^ F) in an affine semigroup 
ring k[Q] for a face F C Q. A binomial ideal I C k[Q] is pp-primary if and only if 

1. Every connected component of Gi other than {u e Q \ t u G /} is F -finite. 

2. F acts on the set of F -finite components semifreely with finitely many orbits. 

Thus the primary condition is fundamentally a finiteness condition — or really a 
pair of finiteness conditions. A proof is sketched after Example I2.26( but first, the 
terminology requires precise explanations. Semifreeness, for example, guarantees that 
the set of F-finite components is a subset of a set acted on freely by ZF; this is part 
of the characterization of semifree actions in |KM10] . 

Definition 2.24. An action of a monoid F on a set T is a map F x T — > T, 
written (/, t) y f + t, that satisfies + t — t for all t 6 T and respects addition: 
(/ + g) + t = f + (g + t). The monoid action is semifree if t h- > f + t is an injection 
T ^-t- T for each / e F, and / h > f + 1 is an injection F ^ T for each t G T. 

In contrast to group actions, monoid actions do not a priori define equivalence rela- 
tions, because the relation t ~ / + t can fail to be symmetric. The relation is already 
reflexive and transitive, however, precisely by the two axioms for monoid actions. 



18 



EZRA MILLER 



Definition 2.25. An orbit of a monoid action of F on T is an equivalence class under 
the symmetrization of the relation {(s, t) \ f + s = t for some / G F} CTxT. 

Combinatorially, if F acts on T, one can construct a directed graph with vertex set 
T and an edge from s to t iit = f + s for some / G F . Then an orbit is a connected 
component of the underlying undirected graph. 

Example 2.26. The ideal 



is p-primary for p = pp = (x,y), where F is the z-axis of N 3 . The monoid F acts on the 
F-finite connected components of Gj with two orbits: one on the z-axis, where each 
connected component is a singleton; and one adjacent orbit, where every connected 
component is a pair. The monomial class in this example (outlined by bold straight 
lines) is the set of monomials in (x 2 ,xy,y 2 ). 

Example 2.27. Fix notation as in Examples 12.51 and 1 2 . 9 1 The ideal / = (c 2 , cd, d 2 ) C 
k[Q] is pir-primary. In contrast to Example 12.261 this time the F-finite connected 
components are all singletons, but the two orbits are not isomorphic as sets acted on 
by the face F: one orbit is F itself, while the other is the set depicted in Example 12.91 

Proof sketch for Theorem \2. 23i This theorem is the core conclusion of |DMM10l The- 
orem 2.15 and Proposition 2.13]. The argument is summarized as follows. 

For any set T, let k{T} denote the vector space over k with basis T. If tiqGi 
satisfies the two conditions, then k[Q]/F has finite a filtration, as a k[Q]-module, whose 
associated graded pieces are the vector spaces k{T} for the finitely many F-orbits T of 
F-finite components of Gi. In fact, semifreeness guarantees that for each orbit T, the 
vector space k{T} is naturally a torsion-free module over k[F] = k[Q]/pi?. Finiteness 
of the number of orbits guarantees that the associated graded module of k[Q]/J is a 
finite direct sum of modules k{T}, so it has only one associated prime, namely pi?. 
Consequently k[Q]/I itself has just one associated prime. 

For the other direction, when I is p ir-primary, one proves that inverting the mono- 
mials and binomials outside of p^ annihilates the Q-graded pieces of k[Q]/J for which 
the connected component in GWfi is infinite |DMM10j Lemmas 2.9 and 2.10]. Since 
the elements outside of p^ act injectively on k[Q]/J by definition of p^-primary, every 
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class of ~/ that is not F-finite must therefore already consist of monomials in I. The 
semifree action of F on the F-finite components derives simply from the fact that 
k[Q]/I is torsion-free as a Ik [F] -module, where the k[F]-action is induced by the in- 
clusion k[F] C k[Q]. Generalities about Q-gradings of this sort imply that k[Q]/I 
possesses a filtration whose associated graded pieces are as in the previous paragraph. 
The minimality of over / implies that the length of the filtration is finite. □ 

3. Binomial primary decomposition 

General binomial ideals induce more complicated congruences than primary bino- 
mial ideals. This section completes the combinatorial analysis of binomial primary 
decomposition by describing how to pass from an arbitrary binomial ideal to its pri- 
mary components. There are crucial points where characteristic zero or algebraically 
closed hypotheses are required of the field k, but those will be mentioned explicitly; 
if no mention is made, then k is assumed to be arbitrary. 

3.1. Monomial primes minimal over binomial ideals. The first step is to con- 
sider again the setting from the previous section, particularly Theorem I2.23[ where a 
monomial prime ideal pp is associated to / in an arbitrary affine semigroup ring k[Q], 
except that now the binomial ideal / is not assumed to be primary. The point is to 
construct its pi?-primary component. The nature of Theorem 12.231 splits the construc- 
tion into two parts: 

1. ensuring that the (non-monomial) connected components are all F-finite, and 

2. forcing the face F to act in the correct manner on the components. 

These operations will be carried out in reverse order, with part 2 being accomplished 
by localization, and then part 1 being accomplished by simply lumping all of the 
connected components that are not F-finite together. 

Definition 3.1. For a face F of an affine semigroup Q and a binomial ideal / C k[Q], 

(I : t F ) = I[ZF] nk[Q] 

is the kernel of the composite map k[Q] — > k[Q]/I — > (k[Q]/I)[ZF]. 

Remark 3.2. The notation (/ : t F ) is explained by an equivalent construction of this 
ideal. Indeed, the usual meaning of the colon operation for an element y G k[Q] is that 
(I : y) — {z G k[Q] \ yz G /}. Here, (J : t F ) = (I : t^) for any lattice point / lying 
sufficiently far in the relative interior of F. Equivalently, (/ : t F ) = (I : (v)°°) = 
[J rgN (J : t r ^) for any lattice point / G F that does not lie on a proper subface of F. 

Combinatorially, the passage from / to (J : t F ) has a concrete effect. 

Lemma 3.3. The connected components of the graph Grj. t F^ defined by (I : t F ) are 
obtained from n Gi by joining together all pairs of components C u and C v such that 
C u+ f = C v+ f for some f G F , where C u is the connected component containing u G Q. 
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Proof. Two lattice points u,v G Q lie in the same component of Gn. t F) precisely when 
there is a binomial t u - \t v G k[Q] such that t f (t u - \t v ) G I for some f E F. □ 

Roughly speaking: join the components if they become joined after moving them 
up by an element / G F. Illustrations of lattice point phenomena related to binomial 
primary decomposition become increasingly difficult to draw in two dimensions as the 
full nature of the theory develops, but a small example is possible at this stage. 

Example 3.4. The ideal / = (xz — yz, x 2 — x 3 ) C k[x, y, z] yields the same graph as 
Example 12.41 except for two important differences: 

• here there are many fewer edges in the xy-plane (z = 0); and 

• every horizontal (z = constant > 1) slice of the big region is a separate con- 
nected component, in contrast to Example 12.41 where the entire big region was 
a single connected component corresponding to the monomials in the ideal. 



(xz — yz, x 2 — x 3 ) C k[x, y, z] 



Gi 




Only the second generator, x 2 — x 3 , is capable of joining pairs of points in the xy- 
plane, and it does so parallel to the x-axis, starting at x = 2. Of course, x 2 — x 3 also 
joins pairs of points in the same manner at positive heights z > 1, but again only 
starting at x = 2. The first generator, xz — yz, has the same effect as x — y did in 
Example 12. 4[ except that xz — yz only joins pairs of lattice points at height z = 1 or 
more. In summary, every connected component of Gi in this example is contained in 
a single horizontal slice, and the horizontal slices of Gi are 



z = and z > 1. 

The outline of the big region is drawn as a dotted line in the z > 1 slice illustration, 
which depicts only enough of the edges to elucidate its three connected components. 
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Let F be the part of N 3 in the iq/-plane, so pF = (z). The ideal (7 : t F ) = (7 : z°°) = 
(7 : z) = (x — y, x 2 — x 3 ) again has the property that every connected component 
of Cr(j :je ) is contained in a single horizontal slice, but now all of these slices look like 
the z > 1 slices of Gj. Compare this to the statement of Lemma [3. 3 1 

Theorem 3.5. Fix a monomial prime pF = (t u \ u ^ F) in an affine semigroup ring 
k[Q] for a face F C Q. If pp is minimal over a binomial ideal I C k[Q] andW Gi[zF] is 
the set of finite components of the graph Gnzm, then the pp-primary component of I is 

(I:t F ) + (t u \C u ^WoG I[ZF ]). 

The exponents on the monomials in this primary component are precisely the elements 
of Q that lie in infinite connected components of the graph Gj[zf] ■ 

Proof. This is |DMM10| Theorem 2.15]. The p^-primary component of 7 is equal to 
the p^-primary component of (7 : t F ) because primary decomposition is preserved 
by localization (see |AM69t Proposition 4.9], for example), so we may as well assume 
that I = (7 : t F ). It is elementary to check that F acts on the connected components. 
The action is semifree on the F-finite components, for ii f + u ~ g + u for some 
f,g e F, then u ~ n(f — g) + u for all n G N, whence C u G hqGi^f} is infinite; and 
if/ + w~/ + w then u ~ v, because x-^ is a unit on I\LF\. It is also elementary, 
though nontrivial, to check that the kernel of the usual localization homomorphism 
k[Q]/I — > k[Q] PF /I PF — inverting all polynomials outside of pp, not just monomials- 
contains every monomial t u for which C u ^ 7fo(j/[zF] |DMM10l Lemmas 2.9 and 2.10]. 
Now note that (/ : t F ) + (t u \ C u ttoGi[zf]) is already primary by Theorem 12.231 □ 

Remark 3.6. The graph of I\LF\ has vertex set Q[Z,F], which naturally contains Q. 
That is why, in Definition 12. 21 1 and Theorem 13. 5 [ it makes sense to say that an element 
of Q lies in a connected component of G/[zf]- 

Example 3.7. The p^-primary component of the ideal I in Example 13.41 is the ideal / 
in Example 12.41 the differences that remain, after the localization operation in Exam- 
ple E31 is complete, are erased by lumping together the infinite connected components 
into a single monomial component. 

Example 3.8. Starting with (/ : t F ) in Theorem l3.5l it is not enough to throw in the 
monomials whose exponents lie in infinite components of Grj. t Fy, that is, a connected 
component of Grj. t F) could be finite but nonetheless equal to the intersection with Q 
of an infinite component of G/[zf]- This occurs for / = (xz — yz) C k[x, y, z], with F 
being the rcy-coordinate plane of Q = N 3 , so pp — (z). Every connected component 
of Gi is finite, even though 7 = (J : t F ) is not primary. When x and y are inverted 
to form I[7jF], the components at height z > 1 become cosets of the line spanned 
by [_J, whose intersections with N 3 are bounded. Hence the (^-primary component 
of 7 is 7 + (z) = (z), as is clear from the primary decomposition 7 = (z) D (x — y). 
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3.2. Primary components for arbitrary given associated primes. For this sub- 
section, fix a binomial ideal / C k[x] in a polynomial ring with a binomial associated 
prime I Pj j for some character p : L — > k* defined on a saturated sublattice L C Z J . 
Now it is important to assume that the field k is algebraically closed of characteris- 
tic 0, for these hypotheses are crucial to the truth of Theorem 12.201 an d that theorem 
is the tool that reduces the current general situation to the special case in Section [37T1 
The logic is as follows. 

Every binomial J Pj j-primary ideal contains I p by Theorem 12.201 Since we are trying 
to construct a binomial J Pi j-primary component of / starting from I itself, the first 
step should therefore be to enlarge I by throwing in I p . Here is a formal statement. 

Proposition 3.9. Fix a binomial ideal I C k[x] with k algebraically closed of charac- 
teristic 0. If P is any binomial I p j-primary component of I, then P is the preimage 
in k[x] of a binomial (I Pt j/I p )-primary component of (I + I p )/I p C k[x]// p . 

An alternative phrasing makes the point of considering the quotient k[x]// p clearer. 

Proposition 3.10. If P is an I Pt j-primary binomial ideal in k[x] ; with k algebraically 
closed of characteristic 0, then the image of P in the affine semigroup ring k[Q] = 
k[x]// p is a binomial ideal pF-primary to the monomial prime pF = I P: j/I p in k[Q]. 

Proof. This is an immediate consequence of Theorem 12.201 and Theorem 11.221 along 
with Corollary 11.241 the affine semigroup Q is (N J /L) x N J , and the face F is the 



Thus the algebra of general binomial associated primes for polynomial rings is lifted 
from the algebra of monomial associated primes in affine semigroup rings. The final 
step is isolating how the combinatorics, namely Theorem 12.231 lifts. Since the algebra 
of quotienting k[x] modulo I p corresponds to the quotient of N n modulo L, we expect 
the lifted finiteness conditions to involve cosets of L. 

Definition 3.11. A subset of N n is L-bounded for a sublattice L C Z n if the subset 
is contained in a finite union of cosets of L. 

Corollary 3.12. Fix a binomial ideal I C k[x] with k algebraically closed of charac- 
teristic 0. If I p ,j is minimal over I , then the I p j-primary component of I is 



If I Pt j is associated to I but not minimal over I , then for any monomial ideal K 
containing a sufficiently high power of rrij = (xj | i ^ J) , an I Py j -primary component 
of I is defined as P is, above, but using I' K in place of V , where 



copy of N J /L = (N J /L) x {0} in Q. 



□ 




Xn, to be 



r K =((I + I p + K):x?). 
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Proof sketch. This is |DMM10l Theorem 3.2]. The key is to lift the monomial minimal 
prime case for affine semigroup rings in Theorem l3.5l to the current binomial associated 
prime case in polynomial rings using Propositions 13.91 and 13.101 For an embedded 
prime J Pi j, one notes that any given J p j-primary component of / must contain a 
sufficiently high power of mj, so it is logical to begin the search for an J p j-primary 
component by simply throwing such monomials along with I p into I. But then I Pi j is 
minimal over the resulting ideal I + I p + K, so the minimal prime case applies. □ 

The definition of P in the theorem says that Gp has two types of connected com- 
ponents: the ones that are L-bounded upon localization along Z J , and the connected 
component consisting of exponents on monomials in P. The theorem says that Gp 
shares all but its monomial component with the graph Gji, and that the other con- 
nected components of Gp fail to remain L-bounded upon localization along Z J . In 
the case where I p> j is an embedded prime, the graph-theoretic explanation is that 
Gp has too many connected components that remain L-bounded upon localization; 
in fact, there are infinitely many N J -orbits. The hack of adding K throws all but 
finitely many N J -orbits into the L- infinite "big monomial" connected component. 

Example 3.13. A primary decomposition of the ideal I = (x 2 — xy,xy — y 2 ) was 
already given in Example 12.171 Analyzing it from the perspective of Corollary 13.121 
completes the heuristic insight. 

First let I p j = (x — y), so J = {1,2} and p : L — > k* is trivial on the lattice L 
generated by [_J. Then I' — 1+ (x — y) is already prime. Passing from / to /' has the 
sole effect of joining the two isolated points (the basis vectors) on the axes together. 

Now let I Py j = (x,y), so J = and p is the trivial (only) character defined on 
L = Z J = {0}. Every connected component of Gj in Example 12.171 remains L- 
bounded upon localization along Z J , but there are infinitely many such components. 
Choosing K = (x, y) e , so that V K — V + (x, y) e , kills off all but finitely many, to get 



In particular, taking e = 2 recovers the primary decomposition from Example 12.171 

3.3. Finding associated primes combinatorially. The constructions of binomial 
primary components in previous sections assume that a monomial or binomial associ- 
ated prime of a binomial ideal has been given. To conclude the discussion of primary 



y 



P = (x 2 — xy, xy — y 2 ) + (x, y) e C k[x, y] 
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decomposition of binomial ideals, it remains to examine the set of associated primes. 
The existence of binomial primary decompositions hinges on a fundamental result, 
due to Eisenbud and Sturmfels [ES96[ Theorem 6.1], that was a starting point for all 
investigations involving primary decomposition of binomial ideals. 

Theorem 3.14. Every associated prime of a binomial ideal in k[x] is a binomial 
prime if the field k is algebraically closed. 

Although the statement is for polynomial rings, a simple reduction implies the 
existence of binomial primary decomposition in the generality of monoid algebras as 
defined in Section I2.2[ given the construction of binomial primary components. 

Corollary 3.15. Fix a finitely generated commutative monoid Q and an algebraically 
closed field k of characteristic 0. Every binomial ideal I in k[Q] admits a binomial 
primary decomposition: I — Pi H ■ • • H P r for binomial ideals Pi, . . . , P r . 

Proof. Choose a presentation N n -» Q. The kernel of the induced presentation k[x] -» 
k[Q] is a binomial ideal in k[x]. Therefore the preimage of I in k[x] is a binomial 
ideal I'. The image in k[Q] of any binomial primary decomposition of I' is a binomial 
primary decomposition of /. Therefore it suffices to prove the case where Q = N n 
and I' = I. Since every associated prime of / is binomial by Theorem 13. 14} the result 
follows from Corollary 13. 121 □ 

Corollary 13.151 is stated only for characteristic to demonstrate the connection 
between prior results in this survey. However, the restriction is unnecessary. 

Theorem 3.16. Corollary \3.15\ holds for fields of positive characteristic, as well. 

Proof. For polynomial rings this is |ES96[ Theorem 7.1], and the case of general 
monoids Q follows by the argument in the proof of Corollary 13.151 □ 

What's missing in the positive characteristic case is combinatorics of primary ideals. 

Open Problem 3.17. Characterize primary binomial ideals and primary components 
of binomial ideals combinatorially in positive characteristic. 

Note, however, that a solution to this problem would still not say how to discover — 
from the combinatorics — which primes are associated. The same is true in character- 
istic 0. Thus Corollary 13. 121 is unsatisfactory for two reasons: 

• it requires strong hypotheses on the field k; and 

• it assumes we know which primes J Pi j are associated to /. 

Fortunately, there is a combinatorial, lattice-point method to recognize associated 
primes — or at least, to reduce the recognition to a finite problem. The main point 
is Theorem 13.261 : the combinatorics of the graph Gi can be used to construct a 
decomposition of / as an intersection of "primary-like" binomial ideals in a manner 
requiring no hypotheses on the characteristic or algebraic closure of the base field. 
The statement employs some additional concepts. 
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Definition 3.18. A subset of Z n is J -bounded if it intersects only finitely many cosets 
of Z J in Z n . 

Lemma 3.19. If I C k[x] is a binomial ideal, then Z J acts on the set of J -bounded 
components of the graph Gjnj-i on Z J x N J induced by the localization I[Z J ] along N J . 

Proof. In fact, Z J acts on all of the connected components, because the Laurent 
monomials x u for «6Z are units modulo I[Z J ] = I[ZF] for the face F = N J . □ 

Definition 3.20. A witness for a sublattice ICZ J potentially associated to / is any 
element in a J-bounded connected component of Gmj-i whose stabilizer is L. 

Example 3.21. The binomial ideal 

y 

A 



I = (y- x 2 y, y 2 - xy 2 , y 3 } C k[x, y] 




induces the depicted congruence. Its potentially associated lattices are all contained 
in Z = ZW, parallel to the x-axis. The lattices are generated by 0, by [J, and by [J. 

The subset J is part of the definition of potentially associated sublattice; it is not 
enough to specify L alone. The notion of associated lattice, without the adverb "po- 
tentially" , would require further discussion of primary decomposition of congruences 
on monoids; see the definition of associated lattice in |KM10j . That said, the set of po- 
tentially associated lattices, which contains the set of associated ones, suffices for the 
purposes here, although sharper results could be stated with the more precise notion. 

Proposition 3.22. Every binomial ideal I C k[x] has finitely many potentially asso- 
ciated lattices L C Z J . If K — (J : x") C k[x] is the annihilator of x. u in k[x]/J for a 
witness a G N", then K[Z J ] + m.j = I at j[Z J ] for a uniquely determined witness char- 
acter a : L — > k*. Given I, each L C Z J determines finitely many witness characters. 

Proof. This is proved in |KM10j on the way to the existence theorem for combinato- 
rial mesoprimary decomposition. The finiteness of the set of potentially associated 
lattices traces back to the noetherian property for congruences on finitely generated 
commutative monoids. The conclusion concerning K is little more than the charac- 
terization of binomial ideals in Laurent polynomial rings |ES96t Theorem 2.1]. The 
finiteness of the number of witness characters occurs because witnesses for L C Z J 
with distinct witness characters are forced to be incomparable in N n . □ 



26 



EZRA MILLER 



In Proposition 13.221 the domain L of the character a appearing in J CT) j need not be 
saturated (see Definition 11.231) . and no hypotheses are required on the field k. 

Deducing combinatorial statements about associated primes or primary decompo- 
sitions of binomial ideals is often most easily accomplished by reducing to the case of 
ideals with the simplest possible structure in this regard. 

Definition 3.23. A binomial ideal with a unique potentially associated lattice is 
called mesoprimary. A mesoprimary decomposition of a binomial ideal J C |[x] is an 
expression of / as an intersection of finitely many mesoprimary binomial ideals. 

Example 3.24. Primary binomial ideals in polynomial rings over algebraically closed 
fields of characteristic are mesoprimary. That is basically the content of Theo- 
rem 12.231 for such fields, given Proposition 13.101 More precisely, the combinatorics of 
mesoprimary ideals is just like that of primary ideals, except that instead of an affine 
semigroup acting semifreely, an arbitrary finitely generated cancellative monoid acts 
semifreely; see the characterizations of mesoprimary congruences in |KM10] . 

Definition 13.231 only stipulates constancy of the combinatorics, not the arithmetic — 
meaning the witness characters — but the arithmetic constancy is automatic. 

Lemma 3.25. If I is a mesoprimary ideal, then the witnesses for the unique poten- 
tially associated lattice all share the same witness character. 

Proof. This follows from the same witness incomparability that appeared in the proof 
of Proposition 13.221 The statement is equivalent to one direction of the characteriza- 
tion of mesoprimary binomial ideals as those with precisely one associated mesoprime; 
see |KM10] , where a complete proof can be found. □ 

Theorem 3.26. Every binomial ideal I C k[x] admits a mesoprimary decomposition 
in which the unique associated lattice and witness character of each mesoprimary 
component is potentially associated to I. 

Proof. This is a weakened form of the existence theorem for combinatorial mesopri- 
mary decomposition in |KM10j . □ 

The power of Theorem 13.261 lies in the crucial conceit that the combinatorics of the 
graph Gi controls everything, so the lattices associated to the mesoprimary compo- 
nents are severely restricted. Over an algebraically closed field of characteristic 0, for 
instance, every primary decomposition is a mesoprimary decomposition, but usually 
the lattices are not associated to /. This is the case for a lattice ideal II, as long as 
the lattice L is not saturated: the ideal II is already mesoprimary, but the associated 
lattice of every associated prime is the saturation L sat = (L ®i Q) D Z n , the smallest 
saturated sublattice of Z n containing L. In general, the combinatorial control is what 
allows Theorem 13.261 to be devoid of hypotheses on the field. 
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As in any expression of an ideal I as an intersection of larger ideals, information 
about associated primes of / can just as well be read off of the intersectands. For 
mesoprimary decompositions this is especially effective because primary decomposi- 
tion of mesoprimary ideals |KM10] is essentially as simple as that of lattice ideals 
|ES96l Corollary 2.5]. In particular, when the field is algebraically closed, potentially 
associated lattices yield associated primes by way of saturation. The point is that 
only finitely many characters L sat — > k* restrict to a given fixed character L — > k* . In 
fact, when k is algebraically closed, these characters are in bijection with the finite 
group Hom(L sat /L, k*). Thus Theorem 13.261 reduces the search for associated primes 
of / to the combinatorics of the graph Gj, along with a minimal amount of arithmetic. 

Corollary 3.27. // the field k is algebraically closed, then every associated prime of 
I C k[x] is I p j for some character p : L sat — > k* whose restriction to L is one of the 
finitely many witness characters defined on a potentially associated lattice L C Z J of I . 

Proof. Every associated prime of / is associated to a mesoprime in the decomposition 
from Theorem 13.261 Now apply either the primary decomposition of mesoprimary 
ideals [KMlOj or the witness theorem for cellular ideals |ES96[ Theorem 8.1], using 
the fact that mesoprimary ideals are cellular [KM10] . □ 

Remark 3.28. In the special case where / is cellular, meaning that every variable is 
either nilpotent or a nonzerodivisor modulo /, Corollary 13.271 coincides with [ES96t 
Theorem 8.1]. Every binomial ideal in any polynomial ring over any field is an inter- 
section of cellular ideals |ES96t Theorem 6.2], with at most one cellular component 
for each subset J C {1, . . . , n}, so it suffices for many purposes to understand the 
combinatorics of cellular ideals. (Theorem 13.261 strengthens this approach, since meso- 
primary ideals are cellular and their combinatorics is substantially simpler.) The way 
witnesses and witness characters are defined above, however, it is not quite obvious 
that the information extracted from witnesses for the original ideal / and those for its 
cellular components coincides. That this is indeed the case constitutes a key ingre- 
dient proved in preparation for the existence theorem for combinatorial mesoprimary 
decomposition in |KM10j . 

Exercise 3.29. The ideal / = (xz — yz, x 2 — x 3 ) C k[x, y, z] from Example 13.41 has 
primary decomposition / = (x — y, x 2 , xy, y 2 ) R (x — 1, y — 1) R (x 2 , z) R (x — 1, z). The 
reader is invited to find all associated lattices L C Z J of / and match them to the 
associated primes of /. Then the reader can verify, using Corollary 13. 12} that the given 
primary decomposition of / really is one. Hint: take J e {{3}, {1, 2, 3}, {2}, {1, 2}}. 

The upshot of Sections [TH5] is that the lattice-point combinatorics of congruences on 
monoids lifts to combinatorics of monomial and binomial primary and mesoprimary 
decompositions of binomial ideals in monoid algebras. From there, binomial primary 
decomposition is a small arithmetic step, having to do with group characters for 
finitely generated abelian groups. 
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Part 2. Applications 

4. Hypergeometric series 

The idea for lattice-point methods in binomial primary decomposition originated in 
the study of hypergeometric systems of differential equations, particularly their series 
solutions. The literature on these systems and series is so vast — owing to its con- 
nections with physics, numerical analysis, combinatorics, probability, number theory, 
complex analysis, and algebraic geometry — that one section in a survey lacks the abil- 
ity to lend proper perspective. Therefore, the goal of this section is to make a beeline 
for the connections to binomial primary decomposition, with just enough background 
along the way to allow the motivations and conclusions to shine through. Much of 
the exposition is borrowed from |DMM07l IDMM10 7 ] . sometimes nearly verbatim. The 
extended abstract [DMM07] presents a broader, more complete historical overview. 

4.1. Binomial Horn systems. Horn systems are certain sets of linear partial differ- 
ential equations with polynomial coefficients. Their development grew out of the ordi- 
nary univariate hypergeometric theory going back to Gauss (see |SK85j . for example) 
and Kummer |Kuml836] , through the bivariate versions of Appell, Horn, and Mellin 
| Appl880 IHorl889| IHor3T| IMel2l] . These formulations had no apparent connection to 



binomials, but through a relatively simple change of variables, Gelfand, Graev, Kapra- 
nov, and Zelevinsky brought binomials naturally into the picture |GGZ87| fGKZ89j. 

The data required to write down a binomial Horn system consist of a basis for a 
sublattice L C Z n and a homomorphism (3 : II 1 ' jL — > C. Focus first on the basis, 
which is traditionally arranged in an integer n x m matrix B, where m = rank(L). If 
b G Z n is a column of B, then b determines a binomial: 

b g Z" ~* d h+ - d h - e C[d) = C[d u ...,d n ), 

where b = b + — b_ expresses the vector b as a difference of nonnegative vectors with 
disjoint support. Elements of the polynomial ring C[d] are to be viewed as differential 
operators on functions C n — > C. Therefore the matrix B determines a system of 
m binomial differential operators, one for each column. The interest is a priori in 
solutions to differential systems, not really the systems themselves, so it is just as well 

1(B) = (d h + -d h - | b = b+ - b_ is a column of B) C C[d] 

generated by these binomials, because any function annihilated by the m binomials 
is annihilated by all of I{B). 

Example 4.1. In the 0123 situation from Examples 11.21 and 11.161 using variables 
d — d\, d 2 , d 3 , <9 4 instead of a, b, c, d or x±, x 2 , x 3 , x 4 yields 1(B) = (did^ — d^, d 2 d^ — <9f ). 

Example 4.2. In the H°° situation from Examples 11.31 and 11.171 with d variables, 
1(B) = [d l d 3 -d 2 ,d x d A -d 2 ). 
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The set of homomorphisms Z n /L — > C is a complex vector space Hom(Z n /L, C) of 
dimension d := n — m. Choosing a basis for this vector space is the same as choosing 
a basis for (Z n /L) ®zC, which is the same as choosing a d x n matrix A with AB = 0. 
Let us now, once and for all, fix such a matrix A with entries for % — 1, . . . , d and 
j = 1, . . . ,n. The situation is therefore just as it was in Examples 11.161 and 11.17^ and 
our homomorphism Z n /L — > C becomes identified with a complex vector (3 G C d . 
Together, A and (3 determine d differential operators E\ — /3i, . . . , Ed — (3d, where 

Ei = anX\di + • • • + cii n x n d n . 

Note that ciijXjdj is the operator on functions f(xi, . . . , x n ) : C n — >• C that takes the 
partial derivative with respect to Xj and multiplies the resulting function by a^Xj. 

Definition 4.3. The binomial Horn system H(B,(3) is the system 

1(B) f = 
Eif = fof 



E d f = M 

of differential equations on functions /(x) : C n — > C determined by the lattice basis 
ideal 1(B) and the Euler operators E\ — /3i, . . . , Ed — ^d- 

The goal is to find, characterize, or otherwise understand the solutions to H(B,j3). 
Example 4.4. In the 0123 case from Example 11.161 H(B,/3) has lattice basis part 

1(B) f = & (d 1 d 3 - d 2 2 )f = 
and (d 2 d 4 - 8$)f = 
and the Euler operators yield the following equations: 

(xi<9i + x 2 d 2 + x 3 d 3 + x 4 di)f = f} x f 
( x 2 d 2 + 2x 3 <9 3 + 3x 4 d 4 )f = /3 2 f. 

Example 4.5. In the case from Example 14.21 H(B,f3) has lattice basis part 

1(B) f = (d 1 d 3 - 2 )f = 
and (<9i«9 4 - d 2 )f = 
and the Euler operators yield the following equations: 

(x!d!+x 2 d 2 )f = f3 l f 

( x 2 d 2 + x 3 d 3 + x 4 d A )f = (3 2 f. 
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Since Horn systems are linear, their solution spaces are complex vector spaces. 
More precisely, the term solution space in what follows means the vector space of 
local holomorphic solutions defined in a neighborhood of a (fixed, but arbitrary) point 
in C n that is nonsingular for the Horn system. 

Example 4.6. In the 0123 case from Example 14.41 for any parameter vector ft the 
Puiseux monomial / = X-y IS Si solution of H(B, ff). Indeed, 

diW = 82(f) = d 2 d 4 (f) = dl{f) = 0, 

so 1(B) f = 0, and (E 1 - ft)/ = (E 2 - ft)/ = because 

xMf) = (ft - §ft>)/ 
x 2 d 2 (f) = 
xMf) = 
x 4 d 4 (f) = fft/. 

Erdelyi produced this solution and similar ones in other examples |Erd50] . but he 
furnished no explanation for why it should exist or how he found it. In this particular 
example, the Horn system has, in addition to the Puiseux monomial /, three linearly 
independent fully supported solutions, in the following sense. 

Definition 4.7. A Puiseux series solution / to a Horn system H(B, (3) is fully sup- 
ported if there is a normal affine semigroup Q of dimension m and a vector 7 6 C" 
such that the translate 7 + Q consists of vectors that are exponents on monomials 
with nonzero coefficient in /. 

The integer m = rank(L) in the definition is the maximum possible: the Euler 
operator equations impose homogeneity on Puiseux series solutions, meaning that 
every solution must be supported on a translate of L £g>^ C. In fact, the translate is 
by any vector 7 £ C n satisfying A'-f = (3. 

Questions 4.8. Consider the family of Horn systems determined by B with varying ft 

1. For which parameters /3 does H(B,/3) have finite-dimensional solution space? 

2. What is a combinatorial formula for the minimum solution space dimension, 
over all possible choices of the parameter (31 

3. Which (3 are generic in the sense that the minimum dimension is attained? 

4. Which monomials occur in solutions expanded as series centered at the origin? 

These questions arise from classical work done in the 1950s, such as Erdelyi's, and 
earlier. Implicit in Question [3] is that the dimension of the solution space rises above 
the minimum for only a "small" subset of parameters ft 

Example 4.9. In the H°° case from Example I4.5[ if ft = 0, then any (local holo- 
morphic) bivariate function f(xs,x 4 ) satisfying x^d^f + x 4 d 4 f = ft/ is a solution 
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of the Horn system H(B,f3). The space of such functions is infinite-dimensional; in 
fact, it has uncountable dimension, as it contains all Puiseux monomials Xg 3 ^ 4 with 
n>3, w 4 G C and w 3 + = f3 2 . When fa ^ 0, the solution space has finite dimension. 

The om example has vast numbers of linearly independent solutions expressible as 
Puiseux series with small support, but only for special values of (3. In contrast, in the 
0123 case there are many fewer series solutions of small support, but they appear for 
arbitrary values of 0. This dichotomy is central to the interactions of Horn systems 
with binomial primary decomposition. 

4.2. True degrees and quasidegrees of graded modules. The commutative alge- 
braic version of the dichotomy just mentioned arises from elementary (un)boundedness 
of Hilbert functions of A-graded modules (recall Definition I1.12[ Example I1.13[ and 
Lemma 12.151) ; see Definition 14.161 For the remainder of this section, fix a matrix 
A G Z dxn of rank d whose affine semigroup NA C Z d is pointed (Definition 15. 10|) . 

Lemma 4.10. A binomial ideal I C C[d] is A-graded if and only if it is generated by 
binomials x u — Ax v for which Au = A~v . □ 

It is of course not necessary — and it almost never happens — that every binomial of 
the form x u — Ax v with Au = A~v lies in I. 

Example 4.11. / = 1(B) is always A-graded when B is a matrix for ker(A). 

The set of degrees where a graded module is nonzero should, for the purposes of 
the applications to Horn systems, be considered geometrically. 

Definition 4.12. For any A-graded module M, 

tdeg(M) = {a G Z d | M a ^ 0} 

is the set of true degrees of M . The set of quasidegrees of M is 

qdeg(M) = tdeg(M), 

the Zariski closure in C d of the true degree set of M. 

The Zariski closure here warrants some discussion. By definition, the Zariski closure 
of a subset T C C d is the largest set T of points in C d such that every polynomial 
vanishing on T also vanishes on T. All of the sets T that we shall be interested in 
are sets of lattice points in Z d C C d . When T consists of lattice points on a line, for 
example, its Zariski closure T is the whole line precisely when T is infinite. When T 
is contained in a plane, its Zariski closure is the whole plane only if T is not contained 
in any algebraic curve in the plane. In the cases that interest us, T will always be a 
finite union of translates of linear subspaes of C rf . 

Lemma 4.13. If M is a finitely generated A-graded module over C[d], then qdeg(M) 
is a finite arrangement of affine subspaces of C d , each one parallel to TLAj for some 
J C {1, . . . , n}, where Aj is the submatrix of A comprising the columns indexed by J. 
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Proof. This is [DMMlO'l Lemma 2.5]. Since M is noetherian, it has a finite filtra- 
tion whose successive quotients are A- graded translates of C[<9]/p for various A-graded 
primes p. Finiteness of the filtration implies that qdeg(M) is the union of the quaside- 
gree sets of these A-graded translates. But tdeg(C[<9]/p) is the affine semigroup gen- 
erated by {deg((9j) | di £ p}, because tdeg(C[9]/p) is an integral domain. □ 

Example 4.14. If p = I L + mj is a binomial prime ideal, then qdeg(C[<9]/p) = CAj 
is the complex vector subspace of C d spanned by the columns of A indexed by J. 

Example 4.15. In the H°° case from Examples 11.31 and 11.171 consider the A-graded 
module M = C[d±, d 2 , d 3 , d^/I for varying ideals /. Then the true degree sets and 
quasidegree sets can be depicted as follows. 

1. When I = Ia = (did 3 —d 2 , d 3 — <9 4 ), so that M = C[<9i, d 3 ] by the homomorphism 
sending d 2 H- <9i<9 3 and <9 4 n- d 3 , 

A 



M = Cldl/i^da - d 2 , d 3 - <9 4 > ^ Cfa, d 3 ] „ 

deg(9i) = [J] and deg(<9 3 ) = [°] < — > %>[ 

tdeg(M)=N{[;],[;]}=NA {1 , 3} -- 

qdeg(M) = C 2 



2. When / = (d\, d 2 ), so that M = C[d 3 , <9 4 ] by the homomorphism sending d\ 1— > 
and d 2 \-> 0, 



M = C[d]/{d 1 ,d 2 )^C[d 3 ,d 4 ] 

deg(<9 3 ) = [J] and deg(<9 4 ) = [J] < — y d 3 ,aj 
tdeg(M) = N{[;]}=NA {3 , 4} 
qdeg(M) = vertical axis in C 2 
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3. When / = (8f, d 2 ), so that M £ C[8 3 , <9 4 ] © <9i<C[<9 3 , 8 4 ], 

m = c[d]/(dl d 2 ) = c[d 3 , d A ] © d 1 c[d 3 , ad 

deg(9i) = [J] and deg(«9 3 ) = [J] and deg(«9 4 ) = [J] 
tdeg(M) = NA {3 ,4 } U([J]+ NA {3 ,4}) 
qdeg(M) = vertical axis U ([ ] + vertical axis) in C 2 

Definition 4.16. An A-graded prime ideal p C C[d] is 

• tora/ if the Hilbert function u i— > dime (C[<9]/p) u is bounded for u £ Z d , and 

• Andean if the Hilbert function is unbounded. 

The adjective "Andean" indicates that Andean A-graded components sit like a high, 
thin mountain range on Z d , of unbounded elevation, over cosets of sublattices ZAj. 

Example 4.17. Consider the situation from Example 14.151 

1. The prime ideal I a in Example 14.151 1 is toral because the Hilbert function 
of C[8]/Ia only takes the value 1 on the true degrees. By definition, the Hilbert 
function vanishes outside of the true degree set. 

2. The prime ideal / = (81,82) in Example 14.151 2 is Andean because the Hilbert 
function of C[8]/I is unbounded: [J (->• k. 

Example 4.18. In the 0123 situation from Examples 11.21 and 11.161 the ideal I a = 
(8183—82, 8284—83, 8184—8283) is toral under the A-grading. Indeed, for any matrix A 
the toric ideal I a is toral under the A-grading by Lemma 12.151 the Q-graded Hilbert 
function of a monoid algebra k[Q] takes the constant value 1 and is hence bounded. 

Theorem 4.19. Fix an A-graded ideal I C <C[8]. Given that NA is pointed, every 
associated prime of I is A-graded, and I admits a decomposition as an intersection 
of A-graded primary ideals. The intersection /Andean of the primary components of I 
with Andean associated primes is well-defined. 

Proof. The A-graded conclusion on the associated primes is |MS05t Proposition 8.11]. 
The Andean part is well-defined because if p D q for some Andean p then q is also 
Andean. ("The set of Andean primes is closed under going down.") □ 

Corollary 4.20. If I C C[d] is an A-graded ideal, then I admits a decomposition 

I = ^toi-al H -^Andean 

into toral and Andean parts, where itorai is the intersection of the primary components 
of I with toral associated primes in any fixed primary decomposition of I. 



83,84. 
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Definition 4.21. The Andean arrangement of an ideal / is qdeg (C [d] //Andean )• 

The Andean arrangement is a union of affine subspaces of C d by Lemma 14.131 It is 
well-defined by Theorem 14.191 

Example 4.22. The lattice ideal 1(B) = (<9i<9 3 - d 2 , d ± d 4 - d 2 ) from Example |M1 
has primary decomposition 



with toral part J t orai = I a and Andean part /Andean = (^1,^2) by Example 14.171 
Therefore the Andean arrangement of 1(B) is the thick vertical line in Example 14.151 2. 

4.3. Counting series solutions. The distinction between toral and Andean primes 
provides the framework for the answers to Questions 14.81 Throughout the remainder 
of this section, fix a matrix B G Z™ xm of rank m = n — d such that AB = 0. 
Assume that B is mixed, meaning that every nonzero integer vector in the span of 
the columns of B has two nonzero entries of opposite sign. The mixed condition is a 
technical hypothesis arising while constructing series solutions to H(B,/3); its main 
algebraic consequence is that it forces NA to be pointed. 

The multiplicity of a prime ideal p in an ideal I is, by definition, the length of the 
largest submodule of finite length in the localization C[d]p/I f . This number is nonzero 
precisely when p is associated to /. Combinatorially, when p and / are binomial ideals, 
the multiplicity of p in / counts connected components of graphs related to Gi, such as 
those in Corollary 13.121 For the purposes of Horn systems, the most relevant number 
is derived from multiplicities of prime ideals as follows. 

Definition 4.23. The multiplicity fi(L, J) of a saturated sublattice L C Z J is the 
product Lfi, where l is the index \L/(LB D Z J )| of the sublattice 'LB D Z J in L, and 
fi is the multiplicity of the binomial prime ideal II + mj in the lattice ideal 1(B). 

The factor 1 = \L/(LB D Z J )| counts the number of partial characters p : L — > C* 
for which I Pt j is associated to 1(B). It is the penultimate combinatorial input required 
to count solutions to Horn systems. The final one is polyhedral. 

Definition 4.24. For any subset JC {1, . . . , n}, write vo1(t4j) for the volume of the 
convex hull of Aj and the origin, normalized so a lattice simplex in LAj has volume 1. 

Example 4.25. In the 0123 case, vo\(A) = 3, since the columns of A span Z 2 , and the 
convex hull of A with the origin is a triangle that is a union of three lattice triangles. 



1(B) 






vol (4) = 3 



vol(A {M} ) = 1 
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When J = {1,4}, in contrast, vol(y4j) = 1, since the first and last columns of A form 
a basis for the lattice (of index 3 in Z 2 ) that they span. 

Answers 4.26. The answers to Questions 14. 81 for the systems H(B,(3) are as follows. 

1. The dimension is finite exactly when (3 lies in the Andean arrangement of 1(B). 

2. The generic (minimum) dimension is fJ>(L, J) -vol(Aj), the sum being over all 
saturated L C Z J such that II + mj is a toral binomial prime with CAj = C d . 

3. The minimum rank is attained precisely when (3 lies outside of an affine subspace 
arrangement determined by certain local cohomology modules, with the same 
flavor as (and containing) the Andean arrangement. 

4. If the configuration A lies in an affine hyperplane not containing the origin, and 
(3 is general, then the solution space of H (B, (3) has a basis containing precisely 

j /i(L, J) ■ vol(Aj) Puiseux series supported on finitely many cosets of L. 

Example 4.27. To illustrate Answer [4.261 1 in the case, compare Example 14.91 to 
Example 14.221 : the solution space has finite dimension precisely when the parameter 
lies off the vertical axis, which is the Andean arrangement in this case. 

Example 4.28. In contrast, both associated primes are toral in the 0123 case, where 

i(B) = i A n(8 2 ,6 3 ). 

Indeed, the quotient of C[<9] modulo each of these components is an A-graded affine 
semigroup ring, with the second component yielding C[d]/ (82,83) = C[KL4{ 14 }]. It 
follows that the solution space has finite dimension for all parameters /3. 

On the other hand, Answer 14.261 2 is interesting in this 0123 case: Example 14.251 
implies that H(B,(3) has generic solution space of dimension 

3 + 1 = fi(ZB, {1, 2, 3, 4}) + //({0}, {1, 4}), 

with the first summand giving rise to solution series of full support, and the second 
summand giving rise to one solution series with finite support — that is, supported on 
finitely many cosets of {0} — by Answer [4.261 4. Compare Example 14.61 

Proof of Answers \4-26[ These are some of the main results of jDMM10 / ] . namely: 



1. Theorem 6.3. 

2. Theorem 6.10. 

3. Definition 6.9 and Theorem 6.10. 

4. Theorem 6.10, Theorem 7.14, and Corollary 7.25. 

The basic idea is to filter C[8]/I(B) with successive quotients that are A-graded trans- 
lates of C[8] /p for various binomial primes p . There is a functorial ( 'Euler-Koszul" ) 
way to lift this to a filtration of a corresponding D-module canonically constructed 
from H(B,f3). The successive quotients in this lifted filtration are A-hypergeometric 
systems of Gelfand, Graev, Kapranov, and Zelevinsky [GGZ871 IGKZ89j . The solution 
space dimension equals the volume for such hypergeometric systems, and the factor 
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n(L, J) simply counts how many times a given such hypergeometric system appears as 
a successive quotient in the D-module filtration. That, together with series solutions 
constructed by GGKZ, proves Answers 2 and 4. 

When a successive quotient is C[<9]/p for an Andean prime p, the Euler operators 
span a vector space of too small dimension; they consequently fail to cut down the solu- 
tion space to finite dimension: at least one "extra" Euler operator is needed. Without 
this extra Euler operator, its (missing) continuous parameter allows an uncountable 
family of solutions as in Example I4.9( this proves Answer 1. Answer 3 is really a 
corollary of the main results of |MMW05] , and is beyond the scope of this survey. □ 

To explain Erdelyi's observation (Example 14. 6p . note that only | ker A/ZB\ ■ vol(A) 
many of the series solutions from Answer 14.261 4 have full support, where ker(v4) = 
(ZJB) sa t is the saturation of the image of B. The remaining solutions have smaller 
support. Most lattice basis ideals have associated primes other than I^b [HSOOj . so 
most Horn systems H(B, /3) have spurious solutions, whether they be of the toral kind 
(finite-dimensional, but small support, perhaps for special paramaters /3) or Andean 
kind (uncountable dimensional). 

The "hyperplane not containing the origin" condition in Answer 14.261 4 amounts 
to a homogeneity condition on 1(B): the generators should be homogeneous under 
the standard N-grading of C[d], as in Example 14.281 More deeply, this condition is 
equivalent to regular holonomicity of the corresponding D-module [SW08] . As soon 
as the support of a Puiseux series solution is specified, hypergeometric recursions 
determine the coefficients up to a global scalar. 

The recursive rules governing the coefficients of series solutions to Horn hypergeo- 
metric systems force the combinatorics of lattice-point graphs upon binomial primary 
decomposition of lattice basis ideals, via the arguments in the proof of Answers 14.261 
Granted the generality of Sections [TH31 it subsequently follows that the questions as 
well as the answers work essentially as well for arbitrary A-graded binomial ideals, 
with little adjustment [DMM10'| . 

5. Combinatorial games 

Combinatorial games are two-player affairs in which the sides alternate moves, 
both with complete information and no element of chance. The germinal goal of 
Combinatorial Game Theory (CGT) is to find strategies for such games. After briefly 
reviewing the foundations and history of CGT using some key examples (Section 15. II) . 
this section provides an overview of how to phrase the theory in terms of lattice points 
in polyhedra (Section 15. 2p . Exploring data structures for strategies as generating 
functions (Section l5.3l) or in terms of misere quotients (Section l5.4l) leads to conjectures 
and computational open problems involving binomial ideals and related combinatorics. 
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5.1. Introduction to combinatorial game theory. There are many different ways 
to represent games and winning strategies by combinatorial structures. To understand 
their formal definitions, it is best to have in mind some concrete examples. 

Example 5.1. The quintessential combinatorial game is NlM. The players — you 
and I, say — are presented with a finite number of heaps of beans, such as 

•2 V" 

when there are three heaps, of sizes 3, 7, and 4. Any finite set of heaps is a position in 
the game of NlM. The game is played by alternating turns, where each turn consists 
of picking one of the heaps and removing at least one bean from it. For instance, if 
you play first, then you could remove one bean from the 3- heap, or three beans from 
the 4-heap, or all of the beans from the 7-heap, to get one of the following positions: 

•. or V.* • or 

The goal of the game is to play last. As it turns out, if I play first in the 3-7-4 
game, then you can always force a win by ensuring that you play last. How? Take 
the nim sum of the heap sizes: express each heap size in binary and add these binary 
numbers digit by digit, as elements of the field F 2 of cardinality 2: 

1 1 
111 
1 


Whatever move I make will alter only one of the summands and hence will leave a 
nonzero nim sum, at which point you can always remove beans from a heap to reset 
the nim sum to zero; I can't win because removing the last bean leaves a zero nim sum. 
This general solution to NlM is one of the oldest formal contributions to combinatorial 
game theory |Boul902] . More general "heap games" , in which players take beans from 
heaps according to specified rules, constitute a core class of examples for the theory. 

Example 5.2. Chess and its variants give rise to a rich bounty of combinatorial 
games. One of the most famous, other than Chess itself, is Dawson's Chess, 
played on a 3 x d board with an initial position of opposing pawns facing one another 
with a blank rank (row) in between |Daw34j . When d = 7, here is the initial position: 
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Moves in Dawson's Chess are the usual moves of Chess pawns. The only additional 
rule is that a capture must be made if one is possible. For example, if white moves first 
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(as usual) and chooses to push the third pawn, then the game could begin as follows: 
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And now it is black's turn; with no captures available, black is allowed to push any 
pawn in file (column) 1, 5, 6, or 7 down to the middle rank. Another bloodbath 
ensues, and then it is white's turn to push a pawn freely. 

The goal of Dawson's Chess is to force your opponent to play last. Thus, in 
contrast to NlM, a player has won when their turn arrives and no moves are available. 

Definition 5.3. An impartial combinatorial game is a rooted directed graph on which 
two players alternate moves along edges. A player wins by moving to a node (position) 
with no outgoing edges. A game is finite if its graph is finite and has no directed cycles. 

The root of the directed graph corresponds to the initial position. Any finite impar- 
tial game can equivalently be represented as a rooted tree in which the children of each 
position correspond to its options: the endpoints of its outgoing edges in any directed 
graph representation. A given position might be repeated in the tree representation. 
Given a tree representation, an optimally efficient directed graph representation can 
be constructed by identifying all vertices whose descendant subtrees are isomorphic. 

Example 5.4. In the language of combinatorial game theory, NlM is not a finite 
game but a family of finite games that specifies a consistent "rule set" for how to play 
starting from any particular initial position among an infinite number of possibilities. 
In the situation of Example 15.11 using ijk for < i, j, k < 9 to represent three heaps 
of sizes i, j, and k, the top of the game tree is 

374 




074 174 274 304 314 • • ■ 372 373 




004 014 • • • 073 004 • • • 302 303 073 • • • 371 372 



Note that positions such as 372, 004, and 073 are repeated in the tree (even in the 
small bit of the tree depicted here), at the same level or at multiple levels; their 
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descendant subtrees can be identified to form another directed graph representation 
of this game of NlM. Every leaf of the tree corresponds to the position 000. 

Remark 5.5. It is natural to wonder why the games in Definition 15.31 are called 
"impartial". The term is meant to indicate that the players have the same options 
available from each position, as opposed to partizan games, where the players can have 
distinct sets of options. "But only one player is allowed to play from each position," 
you may argue, "so how can you tell the difference between impartial and partizan 
games?" The answer is to put two games side by side; this is called the disjunctive 
sum of the two games: the player whose turn it is chooses one game and plays any 
of their legal moves in that game. The result is that a player can end up making 
consecutive moves in a single game, if the intervening move took place in the other 
game. In a partizan game, such as ordinary Chess, white's options from any given 
position are different from black's options. When white makes two consecutive moves 
on the same board, they are two consecutive moves of white pieces only. In contrast, 
in an impartial game such as NlM, a move by either player could have been made by 
the other player, if the other player had the chance. 

Beyond the ability to distinguish between impartial and partizan games, what are 
disjunctive sums good for? The answer is that these sums arise naturally when posi- 
tions decompose, in the course of play, into smaller independent subgames. 

Example 5.6. The final position in Example 15.21 can be represented as a disjoint 
union of two Dawson's Chess boards, one with one file and one with three, 
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except that now it is black's turn to move. Similarly, an initial move at either end of 
the board obliterates the two columns at that end, leaving the other player to move. 
In addition, any move on a 3 x 1 or 3 x 2 board obliterates the entire board, as does 
a move on the middle file of a 3 x 3 board. 

This description implies that Dawson's Chess is a heap game, by restricting to 
the ordinary (non-capturing) pawn moves. Indeed, any connected board in its initial 
position is a heap whose size is its number of files (that is, its width), and any position 
between bloodbaths is a disjunctive sum of such boards. The rules allow any player to 

• eliminate any heap of size 1, 2, or 3; 

• take two or three from any heap of size at least 3; or 

• split any heap of size d > 3 into two heaps of sizes k and d — 3 — k. 

Dawson's choice for the ending of his fairy chess game was both unfortunate and 
fortuitous. It was unfortunate because it made the game hard: over three quarters of 
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a century after Dawson published his little game, its solution remains elusive, both 
computationally and in a closed form akin to Bouton's NlM solution. 

Open Problem 5.7. Determine a winning strategy for Dawson's Chess and find 
a polynomial-time algorithm to calculate it. 

Dawson's choice was fortuitous because the simple change of ending uncovered a 
remarkable phenomenon: the vast difference between trying to lose and trying to win. 

Definition 5.8. Given a finite combinatorial game, the misere play version declares 
the winner to be the player who does not move last. 

Thus misere play is what happens when both players try to lose under the normal play 
rules. It fosters amusing titles such as "Advances in losing" [Pla09j . Dawson's Chess 
motivated substantial portions of the development of CGT over the past few decades. 

Misere games are generally much more complex than their normal-play counter- 
parts. Heuristically, the reason is that, in contrast to the unique "zero position" in 
normal play, the multiple "penultimate positions" that become winning positions in 
misere play cause ramifications in positions expanding farther from the zero position, 
and these ramifications interfere with one another in relatively unpredictable ways. 

Regardless of the reason, aspects of the fact of misere difficulty were formalized 
by Conway in the 1970s (see |Con01j ). There are, for example, many more non- 
isomorphic impartial misere games than impartial normal play games of any given 
birthday (the height of the game tree). For comparison, note that a complete structure 
theory for normal play games was formulated in the late 1930s |Spr36 , IGru39j . It 



is based on the Sprague-Grundy theorem, building on Bouton's solution of NlM by 
reducing all finite impartial games to it: every impartial game under normal play is, in 
a precise sense, equivalent to a single NlM heap of some size. (The details of this theory 
would be more appropriate for a focused exposition on the foundations of CGT, such 
as Siegel's highly recommended lecture notes [Sie06j . which proceed quickly to the 
substantive aspects from an algebraic perspective. Additional background and details 
can be found in |ANW07l IBCG82] .) Because the "zero position" is declared off-limits 
in misere structure theory, the elegant additivity of normal play under disjunctive 
sum fails for misere play, and what results is algebraically complicated in that case; 
see Section [531 

5.2. Lattice games. Impartial combinatorial games admit a reformulation in terms 
of lattice points in polyhedra. For the purpose of Open Problem 15.71 the idea is to 
bring to bear the substantial algorithmic theory of rational polyhedra |BW03j . The 
transformation begins by a simple change of perspective on NlM, Dawson's Chess, 
and other heap games. 

Example 5.9. For certain families of games, the game tree is an inefficient encoding. 
For heap games, it is better to arrange the numbers of heaps of each size into a 
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nonnegative integer vector whose i entry is the number of heaps of size i. Thus the 
374 and 373 positions from Example 15.41 become 

3 7 4 H (0,0,1,1,0,0,1) £ N 7 
3 7 3 H (0,0,2,0,0,0,1) £ N 7 . 

The moves "make a heap of size j into a heap of size i < f and "remove a heap of 
size j" correspond to other (not necessarily positive) integer vectors, namely 

(. . . , 1, . . . , —1, . . .) = ej — ej for i < j, and 
(. . . , -1, . . .) = -ej for all j > 1, 

where e±, . . . , is the standard basis of Z d . All entries in the moves are zero except 
for the 1 and —1 entries indicated. Nim looks curiously like it could be connected to 
root systems of type A, but nothing has been made of this connection. 

Example 15.91 says that Nim positions with heaps of size at most d are points in N d , 
and moves between them are vectors in Z d . The idea behind lattice games is to 
polyhedrally formalize the relationship between the positions and moves. To that 
end, for the rest of this section, fix a pointed rational cone C C Z d of dimension d, 
and write Q = C D Z d for the normal affine semigroup of integer points in C. As in 
earlier sections, basic knowledge of polyhedra is assumed; see |Zie95] for additional 
background. For simplicity, this survey restricts attention to lattice games that are 
played on (the lattice points in) cones instead of arbitrary polyhedra, and to special 
rule sets for which every position has a path to the origin (cf. |GM10t Lemma 3.5]); 
see jGMlOl §2] for full generality. 

Briefly, a lattice game is played by moving a token on a game board comprising all 
but finitely many of the lattice points in a polyhedral cone. The allowed moves come 
from a finite rule set consisting of vectors that generate a pointed cone containing the 
game board cone. The game ends when no legal moves are available; the winner is the 
last player to move. The misere condition is encoded by the finitely many disallowed 
lattice point positions. To define these properly, it is necessary to define rule sets first. 

Definition 5.10. A rule set is a finite subset rcZ d \ {0} such that 

1. the affine semigroup RT is pointed, meaning that its unit group is trivial, and 

2. every lattice point p £ Q has a T-path to in Q, meaning a sequence 

= po, . . . , p r = p in Q, with p i+1 - pi £ T, 
as illustrated in the following figure. 
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P 






With these conventions, moves correspond to elements of — r rather than of T itself. 
The sign is a choice that must be made, and neither option is fully convenient. The 
choice in Definition 15.101 prevents unpleasant signs in the next lemma. 

Lemma 5.11. Nr contains Q and induces a partial order on Z rf in which p H q 
whenever q — p £ Nr. 

Proof. The containment is immediate from Definition 15.101 The partial order occurs 
because Nr is a pointed affine semigroup. □ 

Definition 5.12. A lattice game played on a normal affine semigroup Q = CflZ d has 

• a rule set T, 

• defeated positions D C Q that constitute a finite T-order ideal, and 

• game board B — Q \ D. 

Lattice points in Q are referred to as positions; note that these might lie off the 
game board. A position p £ Q has a move to q if p — q £ T; the move is legal if q £ B. 
The order ideal condition, which means by definition that q £ D ^ p £ D if p d Q, 
guarantees that a legal move must originate from a position on the game board B. 

Example 5.13. A heap game in which the heaps have size at most d is played on 
Q = N d . Under normal play, the game board is all of N d , so D = 0. To get misere 
play, let the set of defeated positions be D = {0}, so B = N d \ {0}. Larger sets of 
defeated positions allow generalizations of misere play not previously considered. 

Example 5.14. Dawson's Chess is a lattice game on N d when the heap sizes are 
bounded by d. The game board in this case is N d \ {0}, corresponding to misere play. 
The rule set is composed of the following vectors, by Example 15.61 

• ei; 

• e 2 and ej — ej_ 2 for j > 3; 

• e 3 and ej — ej__ 3 for j > 4 and ej — — ej_ 3 _j for j > 5. 

Historically, the abstract theory of combinatorial games was developed more using 
set theory than combinatorics. Formally, a finite impartial combinatorial game is often 
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defined as a set consisting of its options, each being, recursively, a finite impartial 
combinatorial game. Using this language, the disjunctive sum of two games G and H 
is the game G + H whose options comprise the union of {G' + H | G' is an option of G} 
and {G+H' | H' is an option of H}. A set of games is closed if it is closed under taking 
options and under disjunctive sum. In particular, the closure of a single game G is 
the free commutative monoid on G and its followers, meaning the games obtained 
recursively as an option, or an option of an option, etc. See |PS07| and its references 
for more details on closure and on the historical development of CGT. 

Theorem 5.15. Any position in a lattice game determines a finite impartial combi- 
natorial game. Conversely, the closure of an arbitrary finite impartial combinatorial 
game, in normal or misere play, can be encoded as a lattice game played on N d . 

Proof. The first sentence is a consequence of Lemma 15.111 The second is |GM10t 
Theorem 5.1]: if the game graph has d nodes, then lattice points in Q = N d correspond 
to disjunctive sums of node-positions. □ 

The proof of Theorem 15.151 clarifies an important point about the connection be- 
tween lattice games and games given by graphs: lattice game encodings are efficient 
only when the nodes of the graph represent "truly different" positions. 

Example 5.16. The encoding of the 374 Nim game in Example 15.41 by using all d of 
the followers of 374 as coordinate directions in N d is woefully inefficient. On the other 
hand, the 374 position is encoded efficiently in N 7 because it lies in the closure of the 
single Nim heap of size 7, whose followers are "truly different" from one another. 

This explains part of the reason for allowing arbitrary normal affine semigroups as 
game boards: more classes of combinatorial games beyond heap games can be encoded 
efficiently. That said, heap games are now — and have been for decades — key sources 
of motivation and examples. As such, the encoding of heap games is particularly 
efficient for the following class of games |GM10l §6— §7] . 

Definition 5.17. A lattice game is squarefree if it is played on and the maximum 
entry of any vector in the rule set is 1. Equivalently, a squarefree game represents a 
heap game in which each move destroys at most one heap of each size. 

Multiple heaps of different sizes can be destroyed, and a destroyed heap can be 
replaced with multiple heaps of other sizes, as long as the moves still form a rule set. 

Example 5.18. Squarefree games are the natural limiting generalizations of octal 
games, invented by Guy and Smith [GS56] with Dawson's Chess as a motivating 
example. For each k £ N, an octal game specifies whether or not any heap of size 

• k may be destroyed; 

• j > k + 1 may be turned into a heap of j — k; and 

• j > k + 2 may be turned into two heaps of sizes summing to j — k. 
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These constitute three binary choices, and hence are conveniently represented by an 
octal digit < d k < 7. Dawson's Chess is ".137" as an octal game, where the digits 
correspond, in order, to the types of moves in Example 15.141 For example, 7 = 111 in 
binary indicates that all three options are allowed for k = 3, while 3 = Oil indicates 
that only the top two are available for k = 2. The dot in ".137" is just a place-holder. 

5.3. Rational strategies. What does a strategy for a combinatorial game look like? 
Abstractly, the fmiteness condition ensures that one of the players can force a win. 
The argument explaining why is recursive and elementary. But how does one describe 
such a strategy? Lattice games provide malleable data structures for this purpose. 

Definition 5.19. Two subsets W,L<ZB are winning and losing positions for a lattice 
game with game board B if 

• B = W U L is the disjoint union of W and L; and 

• (w + r) nB = l. 

Winning positions are the desired spots to move to. The first condition says that 
every position in B is either a winning position (the player who moved to that spot 
can force a win) or a losing position (the player who moves from that spot can force a 
win by moving to a winning position). The second condition says that losing positions 
are precisely those with (legal) moves to winning positions. 

Example 5.20. Consider the game NlM 2 of NlM with heaps of size at most 2. The 
rule set in this case is {[ ]> [J) The negatives of these vectors — representing 

the legal moves from a generic position — are depicted in the following figure, along 
with the winning positions in N1M2 for both for normal and misere play. An easy way 
to verify that the depicted sets W are forced is to figure out what happens on the 
bottom row first, from left to right, and then proceed upward, row by row. 



normal play 
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D 



misere play 



The defeated position, labeled by D in the misere play diagram, causes the bottom 
row of winning positions to be shifted over one unit to the right. 



Remark 5.21. The disarray caused by the defeated position in Example 15. 20l becomes 
substantially worse with more complicated rule sets in higher dimensions. Much of the 
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study of misere combinatorial games amounts to analyzing, quantifying, computing, 
and controlling the disarray. 

Theorem 5.22. Given a lattice game with rule set T C 1* d and game board B, there 
exist unique sets W and L of winning and losing positions for B. 

Proof. This is |GM10l Theorem 4.6]. The main point is that the cones generated by 
Q and Nr point in the same direction, so recursion is possible after declaring the 
r-minimal positions in B to be winning. □ 

Thus everything there is to know about a lattice game is encoded by its set of 
winning positions: given a pointed normal affine semigroup, Theorem 15.221 implies 
that specifying a rule set and defeated positions is the same as specifying a valid set 
of winning positions, at least abstractly. But the rule set encodes winning strategies 
only implicitly, while the set of winning positions — or better, a generating function 
jfw(t) = Y2wew^ W f° r ^ ne winning positions — encodes the strategy explicitly. The 
following is |GM10[ Conjecture 8.5]. 

Conjecture 5.23. Every lattice game has a rational strategy: a generating function 
for its winning positions expressed as a ratio of polynomials with integer coefficients. 

Example 5.24. Resume Example 15.201 In normal play N1M2, a rational strategy is 

1 



fw(a,b) 



;i-a 2 )(l-6 2 )' 

the rational generating function for the affine semigroup 2N 2 . In misere play, a rational 
strategy is 

a u2 
fw{a,b) = - + 



I — a? (1 -a 2 )(l -b 2 )' 
where the first term enumerates the odd lattice points on the horizontal axis, and the 
second enumerates the normal play winning positions that lie off the horizontal axis. 

Example 5.25. For a squarefree game, if W = W D {0, l} d C N d then 

W = W + 2W l . 

This is [GMlOt Theorem 6.11]. It implies that the game has a rational strategy 

The reader is encouraged to reconcile this statement with Example 15.11 

A rational strategy has a reasonable claim to the title of "solution to a lattice game" 
because it has the potential to be compact, and it can be manipulated algorithmically. 

Theorem 5.26. A rational strategy for a lattice game produces algorithms to 



46 



EZRA MILLER 



• determine whether a position is winning or losing, and 

• compute a legal move to a winning position, given any losing position. 

These algorithms are efficient when the rational strategy is a short rational function, 
in the sense of Barvinok and Woods |BW03] . 

Proof. This is a straightforward application of the theory developed by Barvinok and 
Woods; see [GMlO'j for details. □ 

The efficiency in the theorem is in the sense of complexity theory. Short rational 
generating functions have not too many terms in their numerator and denominator 
polynomials. They are algorithmically efficient to manipulate and — when they enu- 
merate lattice points in polyhedra — to compute. Since computations of lattice points 
in rational polyhedra are efficient, it would be better to get a polyhedral decomposi- 
tion of the set W of winning positions. In fact, examples of lattice games exhibit a 
finer structural phenomenon than is indicated |GM10t Conjecture 8.9] & |GMW09~] . 

Conjecture 5.27. Every lattice game has an affine stratification: an expression of 
its winning positions as a finite union of translates of affine semigroups. 

Roughly speaking, winning positions should be finite unions of sets of the form (lat- 
tice n cone). This definition of affine stratification differs from |GM10t Definition 8.6] 
but is equivalent |MillOt Theorem 2.6]; it would also be equivalent to require the union 
to be disjoint, or (independently of disjointness) the affine semigroups to be normal. 

Example 5.28. Consider again the situation from Examples 15.201 and 15.241 An affine 
stratification for this game is W = 2N 2 ; that is, the entire set of winning positions 
forms an affine semigroup. In misere play, W = ((1, 0) + N(2, 0)) U ((0, 2) + 2N 2 ) is 
the disjoint union of W\ = 1 + 2N (along the first axis) and W2, which equals the 
translate by twice the second basis vector of the affine semigroup 2N 2 . 

Remark 5.29. Conjecture 15.271 bears a resemblance to statements about local coho- 
mology of finitely generated ZQ-graded modules M over an affine semigroup ring k[Q] 
with support in a monomial ideal: the local cohomology H\(M) is supported on a 
finite union of translates of affine semigroups |HM05j . If Conjecture 15.271 is true, then 
perhaps it would be possible to develop a homological theory for winning positions in 
combinatorial games that explains why. 

Theorem 5.30. A rational strategy can be efficiently computed from any given affine 
stratification. 



Proof. See [GMIO'I §5] . As with Theorem 15. 26} this is reasonably straightforward, 



applying the methods from |BW03] with care. □ 

Algorithms for dealing with affine stratifications and rational strategies are step- 
ping stones toward a higher aim, which would be to prove the existence of affine 
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stratifications (Conjecture 15.27ft . and hence rational strategies (Conjecture 15.23ft . in 
an efficient algorithmic manner and in enough generality for Dawson's Chess (Open 
Problem 15.7ft . Part of the problem to overcome for Dawson's Chess is the need to 
deal with increasing heap size d. That problem is key, since the algorithm for Daw- 
son's Chess is supposed to be polynomial in d as d — > oo, but affine stratifications 
and rational strategies at present are designed for fixed d. 

5.4. Misere quotients. The development of lattice games and rational strategies 
outlined in the previous subsections were motivated by — and continue to take cues 
from — exciting recent advances in misere theory by Plambeck and Siegel |Pla05j IPS07] 
pertaining to misere quotients (Definition l5.31ft . The lattice point methods are also be- 
ginning to return the favor, spawning new effective methods for misere quotients. This 
final subsection on combinatorial games ties together the lattice point perspectives on 
games and binomial ideals with misere quotients, particularly in Theorem 15.361 

Definition 5.31. Fix a lattice game with winning positions W C Q in a pointed 
normal affine semigroup. Two positions p,q G Q are indistinguishable, written p ~ q, if 

{p + Q)DW -p = (q + Q)nW -q. 

In other words, p + rEW^q + rEW for all r E Q. The misere quotient of G is 
the quotient Q = Q/~ of the affine semigroup Q modulo indistinguishability. 

Geometrically, indistinguishability means that the winning positions in the cone 
above p are the same as those above q, up to translation by p — q. 



p + r vs. q + r 








Lemma 5.32. Indistinguishability is a congruence in the sense of Definition \2.1% so 
the misere quotient Q is a monoid. □ 

Example 5.33. In the situation of Example 15.251 the misere quotient is a quotient 
of (Z/2Z) d = N d /2N d jGMlOl Proposition 6.8]. As with Example EM the reader is 
encouraged to reconcile this statement with Example 15.11 
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Example 5.34. For N1M2 (Examples 15.201 l5.24[ and 15.281) . the misere quotient is the 
commutative monoid with presentation Q = (a, b | a 2 = 1, b 3 = b), in multiplicative 
notation. This monoid has six elements because it is (a | a 2 = 1) x (b | b 3 = b), and 
the second factor has order 3. The presentation of Q can be seen geometrically in 
the right-hand figure from Example 15 . 201 : translating the grid two units to the right 
moves W bijectively to the part of W outside of the leftmost two columns (this is 
a 2 = 1), and translating the grid up by two units takes the part of W above the first 
row bijectively to the part of W above the third row (this is b 3 = b). 

Misere quotients were introduced by Plambeck [Pla05] as a less stringent way to 
collapse the set of games than had been proposed earlier by Grundy and Smith |GS56j . 
in view of Conway's proof that very little simplification results when the collapsing 
is attempted in too large a universe of games |Con01[ Theorem 77]. Misere quotients 
have subsequently been studied and applied to computations by Plambeck and Siegel 
|PS07l ISie07j . the point being that taking quotients often leaves a much smaller— 
and sometimes finite — set of positions to consider, when it comes to strategies. The 
Introduction of [PS07j contains an excellent account of the history, including personal 
accounts from some of the main players. 

The first contribution of lattice games to misere theory is the following. 

Proposition 5.35. A short rational strategy for a game played on Q results in an 
efficient algorithm for determining the indistinguishability of any pair of positions 
in Q. In particular, an affine stratification results in such an algorithm. 

Proof. This is the main result in [GMlO'l §6] . If / = J2 q eQ an d g = J2 q eQ 
are two short rational functions, then their Hadamard product f * g = ^2 q& Q i Pq4 ) q^ q 
can be efficiently computed as a short rational function [BW03j . □ 

The final result in this section combines three of the main themes thus far in the 
survey: lattice games, misere quotients, and — for the proof — binomial combinatorics. 

Theorem 5.36. Lattice games with finite misere quotients have affine stratifications. 

Proof. This is |Mill0l Corollary 4.5], given that we are working with games played on 
normal affine semigroups. The proof proceeds via a general result [MillOl Theorem 3.1] 
of interest here: the fibers of any projection Q — >• Q from an affine semigroup Q to a 
monoid Q all possess affine stratifications. This is proved using combinatorial meso- 
primary decompositions of congruences — the (simpler) monoid analogues of binomial 
mesoprimary decomposition — whose combinatorics, as in Example I3.24[ gives rise to 
affine stratifications of fibers. When the misere quotient of a lattice game is finite, 
the winning positions automatically comprise a finite union of fibers. □ 

Open Problem 5.37. Find an algorithm to compute the misere quotient of any 
lattice game starting from an affine stratification. 
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Algorithms for computing finite misere quotients are known and useful |PS07l Ap- 
pendix A]. In addition, Weimerskirch has algorithmic methods that apply in the 
presence of certain known periodicities |Wei08j . although for infinite quotients the 
methods fail to terminate. 

Binomial primary decomposition, or at least the combinatorial aspects present in 
mesoprimary decomposition of congruences on monoids, is likely to play a role in 
further open questions on misere quotients, including when finite quotients occur, 
and more complex "algebraic periodicity" questions, which have yet to be formulated 
precisely |PS07t Appendix A. 5]. 

6. Mass-action kinetics in chemistry 

Toss some chemicals into a vat. Stir. What products are produced? How fast? If 
the process is repeated, can the result differ? These questions belong to the study of 
chemical reaction dynamics. One of the earliest theories of such dynamics, the law of 
mass action, was formulated by Guldberg and Waage in 1864 |GW1864j . It is widely 
observed to hold in real-life chemical systems (as distinguished from, say, biochemical 
systems; see Remark 16.91) . 

Over the years, mass-action kinetics has matured, especially certain mathematical 
aspects following seminal work by Horn, Jackson, and Feinberg [HJ721 IFei87] from 
the 1970s and onward. In the past decade, the resulting mathematical formalizations 
have seen increasing amounts of algebra, particularly of the binomial sort. This section 
provides a brief overview of mass-action kinetics (Section 16. ip . covering just enough 
basics to understand the relevance of binomial algebra. From there, the main goal is 
to explain the Global Attractor Conjecture (Section 16. 2p . which posits that a system 
of reversible chemical reactions always reaches the same steady state if one exists, 
with a view to how binomial primary decomposition could be relevant to its solution. 

Length constraints prevent many substantial details, as well as examples demon- 
strating key phenomena, from being included. For an elementary introduction to 
chemical reaction network theory, in mathematical language, the reader is referred to 
the well- written notes by Gunawardena |Gun03j . For details on an abstract formal- 
ization of the law of mass-action in terms of binomials, see |AGHMR09j . 

6.1. Binomials from chemical reactions. Before presenting mass-action kinetics 
in general, it is worthwhile to study a small sample reaction. 

Example 6.1. Consider the breakdown of hydrogen peroxide into water and oxygen: 

2H 2 2 ^ 2H 2 + 2 

The A and \x here are rate constants: A indicates that two molecules of peroxide 
decompose into two molecules of water and one molecule of oxygen at some rate, and \i 
indicates that the reverse reaction also occurs, though at another (in this case, slower) 
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rate: two molecules of water and one molecule of oxygen react to from two molecules of 
peroxide. To be precise, let x = [H 2 2 ], y = [H 2 0], and z = [0 2 ] be the concentrations 
of peroxide, water, and oxygen in some medium. These concentrations are viewed as 
functions of time, and as such, they satisfy a system of ordinary differential equations: 

x = 2{xy 2 z — 2\x 2 

y = 2Xx 2 - 2fiy 2 z 

z = Xx 2 — fiy 2 z. 

The right-hand side of x says that after an infinitesimal unit of time, 

• for every two molecules of H 2 2 that came together (this is the x 2 term), two 
molecules of H 2 2 disappear (this is the —2 in the coefficient of x 2 ) some fraction 
of the time (this is the meaning of A in the coefficient of x 2 ); and 

• for every two H 2 molecules and one 2 that came together (the y 2 z term), 
two molecules of H 2 2 are formed (the 2 on y 2 z) some fraction of the time (fi). 

The fact that x and y are squared in all of the right-hand sides is for the same 
reason that y 2 is multiplied by z: the products represent concentrations of chemical 
complexes. Thus y 2 z can be thought of as an "effective concentration" of 2H 2 + 2 . 

Example 6.2. For comparison, it is instructive to see what happens when a new 
species is introduced to the chemical equation. Suppose that the reaction 2H 2 2 ^ 
2H 2 + 2 in Example 16. II had a fictional additional term on the right-hand side: 

2H 2 2 ^ 2H 2 + 2 + 3A 

A* 

The right-hand sides of the equations governing the evolution of the species H 2 2 , 
H 2 0, and 2 would remain binomial: 

x = 2/j,y 2 za 3 - 2Xx 2 

y = 2Xx 2 - 2[iy 2 za 3 

z = Xx 2 — jjy 2 za 3 

as would the new equation a = 3Xx 2 —3fiy 2 za 3 governing the evolution of the species A. 

In general, a chemical reaction involves species si, . . . ,s n with corresponding con- 
centrations [si] = Xi, each viewed as a function Xi = Xi(t) of time. In Example 16.11 the 
species are peroxide, water, and oxygen. A reaction A ^ B occurs between chemical 
complexes A = a±Si + ■ ■ ■ + a n s n and B = b\Si + • • ■ + b n s n . Thus the complex A is 
composed of <2j molecules of Sj for i = 1, . . . , n, and similarly for B. In Example 16. 1\ 
the complexes are H 2 2 and 2H 2 + 2 . 

Definition 6.3. A reaction A ^ B between complexes A = axSi + • • • + a n s n and 

B = biSi + • • • + b n s n evolves under mass action kinetics |GW1864] if species Sj is 
lost at di times a rate proportional to the concentration x" 1 • • • x^ n of A, and gained 
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at bi times a rate proportional to the concentration „ n of B. The reaction is 

reversible if the reaction rates in both directions are strictly positive. 

As in Example 16. 1\ the concentration of a complex A is the product of the species 
concentrations with exponents corresponding to the multiplicities of the species in A. 

Proposition 6.4. The differential equation governing the evolution of the reversible 
reaction A =f± B from Definition \6.3\ under mass-action kinetics is 

Xi = (k - a;)(Ax a - //x b ), 

with rate constants A, /x > 0. In vector form, with A = A a b and /x = A ba> this becomes 

x= (b-a)(A ab x a - A ba x b ) 

Proof. This is merely a translation of Definition 16.31 into symbols. □ 

The factor of Ax a — /xx b in Proposition 16.41 is a scalar quantity; the only vector 
quantity on the right-hand side is b — a. 

A single reaction under mass-action kinetics reaches a steady state when the bino- 
mial on the right-hand side of its evolution equation vanishes. Thus the set of steady 
states for a single reaction is the zero set of a binomial. General reaction systems 
involve more than one reaction at a time: in a given vat of chemicals, simultaneous 
transformations take place involving different pairs of chemical complexes using the 
given set of species in the vat. 

Definition 6.5. For multiple reactions on a set of species, in which each reaction 
A ^ B involves complexes with species vectors a = (a l5 . . . , a n ) and b = (bi, . . . , b n ), 
the law of mass- action is obeyed if the species evolve according to the binomial sum: 

x= ^(b-a)(A ab x a -A ba x b ). 

Thus, when more than one reaction is involved, the 2 th entry of the vector field on the 
right-hand side is not a binomial but a sum of binomials, one for each reaction in which 
species Si occurs. Consequently, the set of steady states need not be binomial [DMIOj . 

Definition 6.6. A point £ = . . . , £ n ) 6 1" is a detailed balanced equilibrium for 
a reversible reaction system if it is strictly positive, meaning that > for all i, 
and every binomial summand on the right-hand side in Definition 16.51 vanishes at £. 
A system is detailed balanced if it is reversible and has a detailed balanced equilibrium. 

Definition 16.61 creates the bridge from chemistry to binomial algebra: the chemical 
interest lies in equilibria, and these are varieties of binomial ideals. Detailed balanced 
equilibria lie interior to the positive orthant in IR n . For such concentrations of the 
species, each reaction A ^ B in the system rests at equilibrium with both A and B 
present at some nonzero concentration. In fact, more is true. 
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Theorem 6.7. A detailed balanced equilibrium is a locally attracting steady state. 

Proof. The main point is that a detailed balanced equilibrium possesses an explicit 
strict Lyapunov function (given by Helmholtz free energy) |HJ72j. lFei87| . □ 

Remark 6.8. Chemical reaction network theory (CRNT) works in more general set- 
tings than systems of reactions each of which is reversible; see |Gun03] for an introduc- 
tion. The theory is most successful when the system of reactions is weakly reversible: 
each individual reaction A — > B need not be reversible, but it must be possible to 
reach the reactant complex A from the product complex B through a sequence of 
reactions in the system. See also Remark 16.111 

Remark 6.9. Mass-action kinetics fails for more complicated chemical systems, such 
as biochemical ones. Indeed, it must fail, for life abhors chemical equilibrium: an 
organism whose chemical reactions are at steady-state is otherwise known as dead. 
The failure of mass-action kinetics in biochemical systems occurs for a number of 
reasons. For one, the reaction medium is not homogeneous — that is, the reactants 
are not well-mixed). In addition, the molecules are often too big, and the number of 
them too small, for the natural discreteness to be smoothed; see [Gun03[ §2]. 

6.2. Global attractor conjecture. Polynomial dynamical systems — linear ordinary 
differential equations with polynomial right-hand sides — behave quite poorly and un- 
predictably, in general. The famous chaotic Lorenz attractor, for example, is defined 
by a vector field whose entries are simple 3-term cubics. However, the binomial nature 
of mass-action chemistry lends a striking tameness to the dynamics. 

The reversibility hypothesis for detailed balanced systems is natural from the per- 
spective of chemistry: every reaction can, in principle, be reversed (although the 
activation energy required might be prohibitive under standard conditions). Over- 
whelming experience says that typical chemical reactions — well-mixed, at constant 
temperature, as in chemical manufacturing — approach balanced steady states, and 
the same products really do emerge every time. But this is surprisingly unknown 
theoretically for detailed balanced systems under mass-action kinetics, even though 
their equilibria are local attractors by Theorem 16.71 

Conjecture 6.10 (Global Attractor Conjecture [HJ721 IHor74] ) . // a reversible re- 
action system as in Definition \6.5\ has a detailed balanced equilibrium, then every 
trajectory starting from strictly positive initial concentrations reaches it in the limit. 

The Global Attractor Conjecture is "the fundamental open question in the field" 
|AGHMR09[ §1], since it would close the book on fundamentally justifying mass- 
action kinetics. It is known that the conjecture holds when the binomial ideal is 
prime |Gop09| . It is also known, for detailed balanced reaction systems with fixed 
positive initial species concentrations, that 

1. the detailed balanced equilibrium is unique |Fei87] . and 
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2. each trajectory tends toward some equilibrium jSonOlt l"Cha03] . 

Thus it suffices to bound every strictly positive trajectory away from all boundary 
equilibria, for then the trajectory limits are forced toward the detailed balanced one. 

Remark 6.11. Detailed balancing is a stronger hypothesis than required for the 
known results listed above, including Theorem 16.71 Weak reversibility as in Re- 
mark 16.81 or even a less stringent condition, often suffices. Detailed balancing is 
also weaker than the hypothesis in the strongest (and still widely believed) form of 
Conjecture 16.101 which stipulates a condition called complex-balancing that implies 
weak reversibility; see, for instance, |AS09l §4.2] for a precise statement. 

What do the boundary equilibria look like? The restriction of a detailed balanced 
reaction system to a coordinate subspace of M. n amounts to forcing the concentrations 
of some reactant species to be zero. Such restrictions still constitute detailed balanced 
reaction systems, and the binomials whose vanishing describes the equilibria come 
from the binomials in the original system. This discussion can be rephrased as follows. 

Proposition 6.12. Boundary equilibria of detailed balanced reaction systems are zeros 
of associated primes of the ideal generated by the binomials in Definition \6.5l Con- 
jecture \6.1C\ holds if and only if every trajectory with positive initial concentrations 
remains bounded away from the zero set of every associated prime. □ 

Proposition ^. 12l brings binomial primary decomposition to bear on the chemistry of 
mass-action kinetics. The details of how this occurs are illustrated by certain special 
cases of Conj ecture 16.101 whose proofs are known. The characterizations of these cases 
rely on a polyhedral concept hiding in the dynamics. 

Definition 6.13. The stoichiometric compatibility subspace of the reaction system in 
Definition 16.51 is the real span S of the vectors a — b over all reactions A ^ B in the 
system. The stoichiometric compatibility class (or invariant polyhedron) of a species 
concentration vector £ G M n is the intersection of the nonnegative orthant with £ + £. 

Lemma 6.14. Trajectories for any reaction system are constrained to lie in the in- 
variant polyhedron of the vector of initial species concentrations. 

Proof. This is immediate from the equation for x in Definition 16.51 □ 

The known cases of Conjecture 16. lOl include all systems whose initial species concen- 
tration vectors have invariant polyhedra of dimension 2 or less |AS09[ Corollary 4.7]. 
In the language of Proposition 16.121 the method of proof is to bound all trajectories 
away from the zero set of every associated prime whose intersection with the relevant 
invariant polyhedron is 

• a vertex [And081 ICDSS09j or 

• interior to a facet |AS09j . 
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For a comprehensive review of known cases of the Global Attractor Conjecture, see 
|A509l §1 and §4]. 

The combinatorics of binomial primary decomposition might contribute further 
than merely the statement of Proposition 16.121 For example, mesoprimary decompo- 
sition (Definition I3.23( see |KM10j ) provides decompositions of binomial ideals over 
the rational or real numbers, and therefore takes steps toward primary decomposition 
over the reals. Mesoprimary decomposition also characterizes the associated lattices 
combinatorially, without a priori knowing the primary decomposition. Both could 
be important for applications to the Global Attractor Conjecture: perhaps finiteness 
conditions surrounding associated lattices (Example I3.24p indicates how to produce 
the desired trajectory bounds, with the reality (i.e., defined over M) of the components 
forcing progress away from the boundary, as opposed to (say) periodicity of some kind. 

In an amazing convergence, graphs associated to event systems [AGHMR091 Def- 
inition 2.9] provide chemical interpretations of the graphs Gi from Sections [2] and |3] 
(particularly Definition 12 . 1 6[) in this survey. Reversibility of the reaction system 
means that it is correct for Gj to be undirected. The characterization of naturality in 
|AGHMR09t Theorem 5.1] is a condition on the primary decomposition of the event 
ideal. This convergence is cause for optimism that lattice-point point combinatorics 
will be instrumental in proving Conjecture 16.101 via Proposition 16.121 
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